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Abstract 



This is the second in a series of papers intended to provide a basic overview of some of 
the major ideas in particle physics. Part I [40J was primarily an algebraic exposition of 
gauge theories. We developed the group theoretic tools needed to understand the basic 
construction of gauge theory, as well as the physical concepts and tools to understand the 
structure of the Standard Model of Particle Physics as a gauge theory. 

In this paper (and the paper to follow), we continue our emphasis on gauge theories, 
but we do so with a more geometrical approach. We will conclude this paper with a brief 
discussion of general relativity, and save more advanced topics (including fibre bundles, 
characteristic classes, etc.) for the next paper in the series. 

We wish to reiterate that these notes are not intended to be a comprehensive introduc- 
tion to any of the ideas contained in them. Their purpose is to introduce the "forest" rather 
than the "trees". The primary emphasis is on the algebraic/ geometric /mathematical un- 
derpinnings rather than the calculational/phenomenological details. The topics were 
chosen according to the authors' preferences and agenda. 

These notes are intended for a student who has completed the standard undergradu- 
ate physics and mathematics courses, as well as the material contained in the first paper in 
this series. Having studied the material in the "Further Reading" sections of [40J would 
be ideal, but the material in this series of papers is intended to be self-contained, and 
familiarity with the first paper will suffice. 
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Preface 



Before diving in to the content of this paper we make a few comments about what we are 
doing with this series. The ultimate origin of the first paper [|40| was a series of lectures 
given to REU students at Baylor University between 2003 and 2009. They began with 
short introductions to relativistic quantum mechanics and over time came to include par- 
ticle physics, general relativity, and several other topics. 

I decided to type out the first part of these notes because at the time I was a graduate 
student and I knew I could save time in teaching the material if the students could just 
read it directly instead of waiting on me to write everything up on the board. After 
doing so it was suggested that we "pretty them up" and post them online. I found a few 
other people interested in coauthoring sections I wasn't as comfortable writing, and after 
posting, we received a tremendous amount of positive feedback about the results. 

I had no intentions of continuing the project, but since finishing graduate school (and 
taking a job unrelated to particle physics), I have found that working on notes like these 
provides a great hobby and gives me an excuse to keep thinking about these fascinating 
topics. I originally planned on this second part of the "Simple Introduction" series to 
cover string theory, but after beginning to write found that I was spending too much 
time elaborating on mathematical ideas and the ideas were getting lost in the details]^ I 
therefore decided to make the second part cover some of the geometrical ideas necessary 
to talk about strings, thus pushing string theory to the third part. However, in working 
on part II, I decided it would be fun to a do a full geometrical picture of what was done 
in lllOl, but soon realized that if these notes were to be self-contained for a first year grad 
student this would be too long. So, the "geometry of gauge theories" turned into a two 
part work, and string theory became part IV. But because the first three parts would be 
a reasonably comprehensive introduction to non-perturbative quantum field theory and 
particle physics, why not include a volume on perturbative quantum field theory and 
particle physics (making string theory part V). But if we've gone that far, it would be 
irresponsible not to include a discussion of supersymmetry and supergravity, and so on 
and so on. 

All that was to say that I've decided to make this a long term project as a post particle 
physicist hobby. The positive side of this is that for those who seem to find the writing 
style, content, and approach helpful, there will be a plethora of topics covered by the time 
the project is finished. The negative side is that it will take significantly longer for the 
series to be completed. For those who don't like the writing style, content, or approach - 

^This is the opposite problem most people have with the standard string theory texts - the authors 
assume the reader has a tremendous amount of math and therefore there are too few details. 
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it doesn't matter for them anyway. 

We'd like to again emphasize the point of these notes. As the abstract indicates we 
aren't trying to provide a comprehensive (or even terribly in-depth) treatment of any of 
the topics we cover. The origin of these notes is in the general plight of graduate stu- 
dents in theoretical particle physics who aren't able to understand everything they see 
immediately (I don't know anyone who can do this). After completing the standard un- 
dergraduate courses in a physics B.S., one still has about two years of time consuming 
course work to get through the standard topics. Then there's special relativity, relativistic 
quantum mechanics, quantum field theory, particle physics, gauge theory, general rela- 
tivity, cosmology, astrophysics, gauge theories, conformal field theory, and so on. And 
all that assumes a strong understanding (usually on a graduate level) of real analysis, 
complex analysis, linear algebra, ODE, PDE, topology, abstract algebra, algebraic topol- 
ogy, differential topology, differential geometry, algebraic geometry. Lie groups, complex 
geometry, fibre bundles, characteristic classes, and on and on and onj^ And then you're 
ready to start a text on string theory. This is all extremely frustrating and overwhelming 
for the vast majority of us. The level of formality in the standard texts on the more ad- 
vanced topics makes learning them even more formidable. Of course, graduate students 
can typically go quite a ways without understanding the nuances of each of these topics, 
but a fuller understanding would certainly be beneficial for most of us. 

While there are countless outstanding references on individual topics (and subtopics) 
on this list, gaining a sense of comfort in working with these ideas requires a holistic 
picture of all of them. And unfortunately, to my knowledge there doesn't exist a single 
reference that puts all of this together in a single coherent picture]^ There are many ex- 
cellent math texts on each topic, but physicists often have a hard time reading literature 
written by mathematicians, and the sheer number of different topics, which span count- 
less authors with different styles, agendas, notations, etc., getting that holistic picture is 
rarely far removed from "frustrating and overwhelming", despite the excellence of indi- 
vidual texts. Furthermore, there are many excellent physics texts on each topic, but rarely 
do physicists want to take the time to outline all of the details underlying the topic they're 
writing about. They usually want to emphasize the physics (who can blame them), and 
they have a specific agenda in what they're writing - to teach the specific physical topic 
they've decided to write about. Including volumes of extra math is unnecessary. 

But for those of us who can't make perfect holistic sense of these ideas the first time 
we come across them, getting that holistic picture is difficult. Frequently grad students in 
their early year of grad school are left with a reading list of dozens, or even hundreds, of 
physics and math books and papers which they will have to read, understand, and assim- 
ilate into a single coherent picture in their minds. Again - "frustrating and overwhelming" 
for most of us. This is the root of this series. 

It is my hope that through the course of these notes we will cover, at least at a surface 
level, the majority of these topics. As we mentioned above, these notes won't come close 
to a full treatment of these topics. We warn anyone intimately familiar with any of these 

^As you likely realize, this list is woefully incomplete. 

^We do make special mention of ||37| and HlH , which both come closer than most texts do, and are 
consequently standards on the bookshelf of most theorists. 
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topics that these notes will likely annoy you due to a lack of rigor, precise language, etc. 
Our goal, first and foremost, is to explain as clearly as possible where these ideas come 
from, what motivates them, and how they fit together. When faced with a choice between 
rigor and clarity, between detail and clarity, between precision and clarity, etc., we chose 
clarity. We hope that after going through these notes, someone who has never come 
across these ideas will have a general hook to hang each idea on to get their bearings and 
a context when the idea comes up later. For example, we don't necessarily care if you 
understand every detail of what a harmonic differential form is, but we do want to you to 
be able to relate it to the star operator, Hodge decomposition, cohomology, Betti numbers, 
Euler characteristics, etc., and know what type of information can generally be gleaned 
from them. 

So, again, don't come to these notes expecting a detailed discussion of every nuance of 
these concepts. We wrote with someone who has never heard of a "co-exact form" or the 
"spin connection" before in mind. We expect the reader to get a general picture, and then 
go get more details on a particular topic on their own, either through directed coursework 
or through relevant literature. 

Furthermore, a major purpose of these notes is explanation. A frequent frustration stu- 
dents have with many of the standard texts is the lack of explanation of what certain ideas 
mean. For example, most texts on general relativity will discuss the origin of the Riemann 
tensor (moving vectors around a rectangle, etc.). However, after doing this they define 
the Ricci tensor and Ricci scalar as linear combinations of the Riemann tensor elements, 
completely ignoring the fact that both have their own unique geometrical meaning! An- 
other example is the meaning of a connection apart from the metric connection. Most 
students learn about connections from introductory general relativity or differential ge- 
ometry texts, where almost the entire discussion is related to the metric connection. Then, 
students come across connections on fibre bundles while trying to learn about gauge the- 
ories and have little to no context for how generalized connections work. Authors seem 
to assume some familiarity with them and cite the results with no explanation of mean- 
ing. Yet another example is the modem definition of the energy-momentum tensor as the 
functional derivative of the Lagrangian with respect to the metric. This list goes on an on. 
We have tried to include discussions of each of these ideas rather than passing them by, 
followed by summaries of the main points so the details don't distract too much from the 
big picture. 

We do make the following warning - in we were careful not to introduce a math- 
ematical idea that wouldn't come up in the physics part. This, we believed, worked just 
fine there. However we can't make the same promise about this volume. We will intro- 
duce several mathematical idea^ in the first part that won't necessarily come back into 
play heavily until later in the series. 

Also, we want to make the warning that the graphics for this volume were either 
done using graphical software or by hand with a paint software. The the latter case we 
apologize in advance for the poor quality. 

The general content of this paper is as follows: We begin in chapter [l] with a brief 
overview of some terms that physicists often forget but will come in handy. A cursory 

^For example, Hodge decomposition. Lie groups, etc. 
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read is all that is necessary. Chapter |2] is then an overview of the main ideas surrounding 
differentiable manifolds. The primary concepts are: manifolds, tangent spaces, cotan- 
gent spaces, vector and covector frames, tangent maps and puUbacks, integration, and 
Lie groups. Chapter |3] is then a superficial overview of algebraic topology, where we 
begin with the most conceptually simple but computationally difficult idea (homotopy), 
and move to the most conceptually difficult but computationally simple idea (cohomol- 
ogy)|^ We discuss the relationships between these ideas throughout and relate them to 
the contents of chapter |2| 

Chapter |4] then begins the real meat of these notes - differential geometry. You can es- 
sentially view the contents of chapters |2||3] as providing formalisms and tools for chapter 
Chapter [i] ties together several of the ideas in the previous chapters together to ulti- 



mately allow for a discussion of curvature through the four standard curvature tensors]^ 
Then, we begin the physics part of the paper in chapter |5] with a very brief discussion 
of electrodynamics in various notations, how they relate, and the advantages of each. We 
conclude this with an admittedly superficial discussion of the Aharonov-Bohm affect. The 
primary point of discussion Aharonov-Bohm is to illustrate that electromagnetism takes 
a form where cohomology can be easily discussed, thus illustrating the rich topological 
structure to the theory. 

The final major chapter, chapter[6| is a very surface level introduction to the main ideas 
of general relativity, including how the theory was formulated, what its core equations 
mean, its correspondence with Newtonian gravitation, and a few specific applications. 

We end chapter [6] with a discussion of general relativity as a gauge theory in section 
6.8 This section, along with the final and very brief chapter]^ are the "whole point" of 



this volume as far as physics go. As indicated, we discuss electrodynamics and general 
relativity largely to illustrate the geometry discussion in the first part of these notes. The 



real agenda is to eventually get to general gauge theories. In section 6.8 and especially 
chapter [7| we introduce how the geometry of these notes generalizes to the tools needed 
to discuss gauge theories. 

Finally, we give some idea of where this series is going. The first paper, |l40l, intro- 
duced (mostly non-perturbative) quantum field theory and particle physics algebraically. 
This paper acts as a primer for the geometry needed to reformulate non-perturbative 
quantum field theory geometrically. Part III will then discuss non-perturbative quantum 
field theory and particle physics more fuUyfl 

Then, with non-perturbative quantum field theory discussed in as much detail as we 
feel necessary for this series, part IV will then be a detailed discussion of perturbative 
quantum field theory and particle physics, including discussions of evaluating Feynman 
diagrams, renormalization and the renormalization group, phenomenology, anomalies, 
etc. 

Both the third and fourth parts will likely be structured in the same way as this part - 



^With homology in the middle. 

^This isn't to say that the ideas in chapters 2]|3 aren't important on their own. They will be later in the 



series - it is only in this paper that they play a secondary role. 

''Our discussions are limited almost entirely to the Levi-Civita connection in this paper, but we do our 
best to prepare for how things may be generalized later. 

^We will include things like anomalies, instantons, monopoles, etc. there. 
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a section of mathematics followed by physics. 

Part V will then be an introduction to supersymmetry and supergravity, including 
both perturbative and non-perturbative topics. After that we haven't decided entirely on 
the order - but the likely topics for individual parts will include: conformal field theory, 
introductory string theory, algebraic geometry, and advanced topics string theory. We 
may also eventually include cosmology, astrophysics, and string cosmology. 

We welcome and encourage any and all questions, comments, or corrections. While 
we have proofread both [40] and these notes, many mistakes were found in [|40|| since they 
were first posted, and there is little doubt that despite our best efforts this volume will be 
no different. We encourage any suggestions regarding correcting mistakes of any kind. 
We would also appreciate any comments regarding how we could improve these notes as 
far as pedagogy goes - the point is to be clear. If you find a section particularly vague or 
unclear (or a section particularly clear), please let us know. 

Matt Robinson 

m_robinson@baylor.edu 
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Chapter 1 

Preliminary Concepts 



1.1 Introductory Concepts 

In this paper an the next, our ultimate goal is to recast nearly everything we did in 
Il40ll in a more rigorous mathematical framework. We spent the first paper learning about 
group theory, which was the necessary "bare minimum" mathematical depth we needed 
to introduce the physical concepts we have looked at so far. For what is to come, we will 
need considerably more mathematical detail, especially when we get to String Theory in 
the next paper. We therefore begin this paper by looking at several foundational mathe- 
matical concepts. You likely already have some familiarity with many of them, or have 
seen them in math courses before. Our introduction here will be brief. 

1.1.1 Sets 

The first mathematical object we need is called a Set. A set is a collection of objects 
that do not necessarily have any additional structure or properties. A collection of fruit, 
numbers, people, points in some space, or anything else can form a set. While set theory 
is a very large and very deep branch of mathematics, we will not need to consider sets 
any further in what we are doing, and we therefore move on. 

1.1.2 Groups 

The second mathematical objects we need are Groups. We discussed groups in great 
length in ||40|, and we therefore won't repeat that discussion. We encourage you to reread 
the relevant sections there, or the references recommended in the Further Reading sec- 
tions there. 

1.1.3 Fields 

The third mathematical objects we need is the concept of a Field. A field F is a collec- 
tion of objects {/o, /i, /2, • • •} along with two operations, +, called addition, and i^, called 
multiplication, such that the following hold: 

1) (F, +) is an Abelian group with identity /q. 



11 



2) Uf^EF^f,^f,EF. 

3) fi-^{fj-*^fk) = (/i^/i)*/it. 

4) /,^l = l*/, = /,. 

6) fi ^ i fj + fk) = /i ^ /i + fi-^fk and 
(/i + /i) -*^fk = fi-^ fk + /i * fk- 



Then F is a field. Additionally, if 

7) fi ^ fj = fj * fi we say that F is Commutative. We do not say that F is Abelian 
in this case because F is a field, not a group. 

1.1.4 Vector Spaces 

The fourth mathematical object we introduce is a Vector Space. A vector space V con- 
sists of a collection of objects {v^, v ; , v^, . . .} e V (we are using bold characters instead of 
the "bar" vector notation), called vectors, and a field F with {/o, /i, /2, ■ ■ ■} e -F, as defined 
above such that the following hold: 

1) {V, +) is an Abelian group. 

2) If /i G F and v, e V, then fc^j e V. 

3) fiifj^k) = (/»*/j)vfc. 

4) Ivj = Vii = v,. 

5) /.(v, + Vfc) = /iVj + /jVfc and 

(Vz + V^)/fc = fkV^+fkVj. 

The most familiar example of a vector space is the "physics I" example of a collection 
of objects with "magnitude and direction". In the usual three dimensions, the vectors are 
i, j, k, and the field F is the real numbers R. 

Despite this first example, it is important to realize that although the word "vector" is 
frequently used to mean something with "magnitude and direction", in reality the idea 
of a vector and a vector field is a much more general concept. The following examples 
should make that clear. 

Consider the field of real numbers with vector 1. This (almost trivially) forms a vector 
space. The field of real numbers with the vectors 1 and i also form a vector space (which 
is the same as the vector space with field C and vector 1). 

As a third example, consider some arbitrary linear differential operator, V, and two 
functions 0i and (p2 which are solutions to Vcpi = V(j)2 = 0. Then, any linear combination 
of 01 and 02 will also be a solution. So, 0i and 02 form a vector space with the field R, C, 
or even Z. 

A final example is the set of all x M matrices with matrix addition. 

Now, we introduce a few definitions to make some familiar ideas more precise. We 
begin with a definition you are likely already familiar with, that of Linear Independence. 
The vectors vo,vj,va,... are said to be linearly independent iff 




(1.1.1) 
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For example, there is no linear combination of the vectors 




e2 = 




(1.1.2) 



that is zero unless the coefficients of both vectors are zero. 

Next, we define the Dimension of a vector space. A vector space is said to be A^- 
dimensional if it is possible to find N non-zero linearly independent vectors, but any set 
of N + 1 vectors in the space are not linearly independent. 

Finally, we define a Basis. A basis of an A^-dimensional vector space is a set of N 
linearly independent non-zero vectors. It is then possible to write any vector in the space 
as a linear combination of the vectors in the basis. Because you should already be familiar 
with this from several undergraduate courses, we will not consider examples of these 
ideas. 

1.1.5 Algebras 

The fifth and final mathematical object we introduce is an Algebra. An algebra con- 
sists of a set of vectors V, a field F, and three operations, + (addition), ★ (scalar multipli- 
cation), and o (vector multiplication), subject to the following constraints: 

1) V,F,+, and form a vector space. 

2) If Vj, \j e V, then Vj o v^- e V. 

3) Vi o {vj + Vk) = Vj o vj + Vj o Vfe. 

(vi + Vj) o Vfc = Vj o Vfc + Vj o Vk. 
To make this more precise, we consider a few examples. The first is the set of all 
n X n matrices, with matrix addition, the usual scalar multiplication, and usual matrix 
multiplication. 

As a second example, consider the set of all symmetric n x n matrices. In this case, 
the matrix product of two symmetric matrices is not in general a symmetric matrix, and 
therefore this is not an algebra. However, if instead of usual matrix multiplication, we 
define the vector product of two symmetric matrices to be Vj o = {v^, vy} (anticom- 
mutation), then we do once again have an algebra, because the anticommutator of two 
symmetric matrices is a symmetric matrix. 

As a third example, consider the set of all antisymmetric n x n matrices. This will 
not form an algebra with matrix multiplication because the matrix product of two anti- 
symmetric matrices is not in general antisymmetric. However, if we define the vector 
multiplication to be the commutator, v, o Vj = [vj, Vj], we once again have an algebra. 
Also, notice that this algebra does not have an identity element, in that there is no Ve such 
that Ve o Vj = Vj o Ve = ±[ve, Vj] = Vj Vz. Furthermore, notice that this algebra is not even 
associative. Clearly, it does have an identity with addition + and scalar multiplication 
and it is associative with both + and i^. 

We also have the choice to impose what is typically called the Derivative Rule (syn- 
onymously called the Leibniz Rule, or the Product Rule) 



Vi O {Vj O Vk) = (Vj O Vj) OVk+ Vj O (Vj O Vk) 



(1.1.3) 
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If we once again consider the set of all antisymmetric n x n matrices, imposing the 
Leibniz rule with the antisymmetric matrices A, B, and C gives 

[A, [B, C]] = [[A, B],C] + [B, [A, C]] (1.1.4) 

which has the more familiar form 

[A, [B, C]] + [C, [A, B]] + [B, [C, A]] = (1.1.5) 

When written in this form, the Leibniz rule is called the Jacobi Identity, which is nothing 
more than the Leibnitz Rule. 

Our final definition is that of a Lie Algebra. A Lie algebra is the algebra of antisym- 
metric n X n matrices which obey the Jacobi Identity. It turns out that this definition is 
actually equivalent to the one we gave in the first paper, though we will not delve into 
the details of the equivalence now. 

You can see that in the progression from sets to groups to fields to vectors space to 
algebras, we are adding a new level of structure to form each object. It should be clear 
from the notes proceeding this section that each of these objects is extremely useful in 
physics. 

It may seem that we are being extremely abstract. In some sense we are. But, it will 
be necessary to understand these concepts to at least the depths we have discussed them 
here. There is tremendous depth to each of these objects, but the primary definitions are 
all we will need. We will provide examples and outline properties as we move through 
the remainder of the notes. 

1.2 References and Further Reading 

The contents of this chapter are outlined in any book on abstract algebra. We recom- 
mend IIT3ll for an introductory discussion of each or 1,25 J for a more advanced discussion 
of each. 
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Chapter 2 

Differential Topology 



2.1 Dual Space 

Before we dive into the geometry, we will present some new algebraic concepts which 
will prove vital to geometry. In fact, you will see soon that this section is actually geom- 
etry, though it won't be obvious until later. We are introducing the following ideas with 
an intentional abstraction, which will prove useful when we begin to look at these ideas 
in various specific contexts. 

We begin with the idea of a Dual Space. Let be a (finite) A^-dimensional vector 
space with field Clearly, we can choose some set of basis vectors (as defined in the 
previous section), which we denote ei,e2,...,eAr. We are using subscripts to label basis 
vectors deliberately. 

Now, we define a new space, called the Dual Space to V, which we denote V*. V* 
will also be an A^-dimensional vector space according to the definition in the previous 
section. However, while the elements of V are called vectors (or contravariant vectors), 
the elements of V* are called covectors (or covariant vectors, or 1-forms). Because V* is an 
A^-dimensional vector space, we can choose a set of basis vectors for V*, which we will 
denote , , . . . , e^. Again, we are using superscripts to label basis covectors deliberately. 
Furthermore, a given covector will have a well defined action on a vector (which we will 
discuss later), in which a covector maps a vector to a real number M. We can therefore 
define the basis vectors in V* so as to maintain the desired relationship with V via the 
constraint 

e^(e,) ^ Sj (2.1.1) 

where the parentheses are merely to say that e' is acting on e^. This constraint is sim- 
ply saying a basis covector in a dual space is defined as the covector which maps the 
corresponding basis vector to 1 G M, and all other basis vectors to G M. 

We are only working with the basis vectors (for now) because we are assuming these 
spaces are linear. Therefore, knowing the behavior of the basis vectors tells us the behav- 
ior of any element of the space. 

^Try to resist the temptation to merely picture the already familiar Euclidian vector space, and rather see 
it as a mathematical object as defined above. 
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The best way think about vectors and covectors is this; a covector acts on a vector, and 
the result is in M. To write this in mathematical language: 

V* = {(j) : V ^ R, linear} (2.1.2) 

which reads "V* is the set of all linear mappings which take an element of to a real 
number". 

As an extremely simple illustration of this using familiar objects, let V be 3- 
dimensional Euclidian space, with basis column vectors 

e, = eg = 85 = (2.1.3) 

and general element 

/ a\ 

= aei + be2 + csg (2.1.4) 

where a,b,c E M. The dual space to V will then be the 3-dimensional vector space with 
the basis of row (co)vectors 

= (1 0) = (0 1 0) e'^ = (O l) (2.1.5) 

and general element 

(a b c) = ae^ + be^ + ce^ (2.1.6) 



where again, a,b,c E M. Notice that (2.1.1 1 is satisfied by this choice of basis for the dual 
space V*. 

So, we have a vector space V, and given any vector in this space, we can "act on it" 
with any covector in V*, where "acting on it" in this case means the usual dot product: 

[a b c) \e j = ad + be + cf (2.1.7) 

(where ad + be + cf is clearly an element of M). 

It may seem strange to treat Euclidian space spanned by column vectors and Euclidian 
space spanned by row vectors as two different spaces, but they are in fact fundamentally 
different^ We will see this more clearly as we proceed. For now, recall that we said that 
covectors act on vectors, resulting in a real number. Notice that, following usual vector 
multiplication, the row vector always appears to the left of the column vector (it is acting 
on the vector), as 

(a b c) ( e I e M (2.1.8) 



^Admittedly, the difference is not transparent in Euclidian space. It will be apparent in less trivial spaces. 
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However, switching the order does not result in a real number: 




b c) (2.1.9) 



This is our first indication that V and V* are in fact different spaces. 

We will continue to use the convention we have established that basis vectors have 
lowered indices, whereas basis covectors have raised indices]^ It is important to realize 
that the column vectors and row covectors are only one example of vectors and covectors, 
not the defining example. As the examples in the last section showed there are many 
different types of objects that can be vectors and covectors. 

Incidentally, there is no such thing as a dot product between two vectors]^ Rather, dot 
products can only be defined between vectors and covectors (or synonymously, between 
vectors and 1-forms). This is what you actually have been doing whenever you take a dot 
product - you are mapping vectors to M using the dual space. 

As a second example, consider the vector space where both V and the field are Mj^ 
Clearly, the dual space will be V* = M, because any real number multiplied by another 
real number will be a real number. You can see this as a consequence of the first example 
because a 1-dimensional row vector is the same as a 1-dimensional column vector. 

A third example is Dirac's "bra and ket" space. The kets |\1') are the vectors, whereas 
the bras (\1'| are the covectors in the dual space. And, as you are familiar with, (\I'i|\E'2) G 
M. 

Now, let's say we want to use another basis for V besides Si. We can switch to a new 
basis by simply doing a standard linear transformation with some non-singluar N x N 
matrix So, the new basis vectors will now be 

N 

< = Y.^T)':e, (2.1.10) 

k=i 

Notice that the set of all possible basis vectors we can transform to is the set of all non- 
singular N X N matrices T, which is simply the group GL(N). 



Clearly, if we have transformed into the new basis e'^, equation (2.1.1 1 will no longer 



hold if we use the untrans formed basis for V*. So, we need to transform the covector 



basis as well. The obvious way to do this, while preserving (2.1.1 1, is to use the inverse of 



^If you are familiar with differential geometry and/or general relativity, you are likely more familiar 
with (contravariant) vectors having raised indices and (co variant) covectors having lowered indices. Notice 
that here the indices label vectors /covectors, not the components of a vector /covector Eventually, we will 
be using the standard notation. 

^This is only mostly true. We will find in chapter |4]that a metric provides a way of taking a dot product 
of two vectors. However, it does this by turning a vector into a covector, so our statement here is technically 
true. 

^Recall the definition of vector space given previously. 

^This is not a change of coordinates, but rather a linear change of basis at a point. Such a linear transfor- 
mation may be the result of a nonlinear change of coordinates, but not necessarily. 
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T, or 

N 



Y^iT-'Tke' (2.1.11) 



k=l 



So now equation (2.1.1) becomes (using the summation convention where an upper 



and lower index being the same means they are to be summed) 



e'e,{T-')l{T)\ 
5',{T-^)l{T)l 

l\a I rp\k 



= 61 (2.1.12) 

which is exactly what we would expect. 

So, we have defined vector spaces spanned by vectors which are more or less famil- 
iar objects. Then, we defined dual spaces spanned by covectors (which are also vector 
spaces). The idea is that covectors act on vectors, mapping them to real numbers. Next, 
we will express this idea in a more algebraic way. 

As a final comment, because we have denoted the basis of a vector space as vectors 
with lower indices, in order to preserve the summation convention (where one upper and 
one lower index is summed), we denote the components of a vector with upper indices. 
That way, a general vector v with components f ^, f ^, . . . and basis 8^,6^,... can be written 
as 

V = v'Bi (2.1.13) 

We will discuss the transformation properties of upper and lower component indices to- 
gether shortly. 



2.2 Forms 

Consider some vector space V . We can form the product space of p copies of this space 

as 

V^'P = V ®V ®V ®---®V p times (2.2.1) 

In the previous section, we defined covectors as objects which take single vectors to 
real numbers. These covectors form a vector space, as we saw above. Now that we have 
defined the product space of several copies of a vector space, we generalize the concept 
of a covector as well. We will do so in the obvious way, with one additional rule. 

First, define the vector space PJ'V as the p-linear antisymmetric product of 1-forms 
(covectors) which map V^'^ to M. Or, in more mathematical language, 

K^V = {4> '■ V^^ — >■ M, p linear, antisymmetric} (2.2.2) 
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Saying that IsPV is "p-linear" means that it is linear in each of the p variables;^ ^ 

= a(f){V (g) ■ ■ ■ ^ v^''^ (g) ■ ■ ■ ^ V) + b(l){V ^ ■ ■ ■ (g) v^^^ (g) ■ ■ ■ ^ V) (2.2.3) 

The antisymmetry rule (which may seem strange to demand now, but will prove very 
important later) simply means that the sign changes when any two of the vectors are 
swapped: 

(j){V O ■ ■ ■ (g) v^*) (g) ■ ■ ■ (g) v^^') (g • ■ ■ (g) F) = -(/)( F (g • • • (g v^^') (g ■ ■ ■ (g v^') (g ■ • ■ (g F) 

Clearly, a consequence of the antisymmetry is that if any vector in F®^ is a linear com- 
bination of any others, then 0(F®'') =0. In other words, if the vectors in V^'^ are not 
linearly independent, then 0(F®p) = 0. 

We call any object that carries upper and lower indices like this a Tensor. Simply put, 
a tensor with n upper indices and m lower indices is called a "tensor of rank (n, m)". A 
tensor of rank (n, m) is an object that maps a tensor of rank (m, n) to M. For example, a 
tensor of rank (0, 0) (no indices) is simply a scalar and is already in M. A tensor of type 
(1, 0) is a vector, and a tensor of type (0, 1) is a covector. A p-form is a special type of rank 
(0,p) tensor, namely one that is totally antisymmetric. 

As another example, consider the tensor of rank (2, 1). This will have two upper (con- 
tracovariant) indices and one lower (covariant) index. In other words, it will consist of 
two vectors and one covector. Therefore it will map any object with one vector and two 
covectors to M. 

Obviously, A^V = V* always. And, if p > (the dimension of V) then 0(F®p) = 
because p > N vectors cannot be linearly independent. By convention we denote A^V = 
M. We will refer to an element of A^V as a "p-form". 

Now, we want a way of forming tensor products of p-forms. Obviously we can't do 
this by merely forming the straightforward product because of the antisymmetry require- 
ment. So, we define what is called the wedge product, denoted A, defined as 

A : APy (g APy AP+'^V (2.2.4) 

So, A takes the set of all p-forms and the set of all g-forms to the set of all {p + g)-forms, by 
creating the totally antisymmetric sum of all p-forms tensored with all g-forms. Or, more 

^This notation (0(F®^)) means simply that the covectors of (j) each act on the vectors of V^p. For example 
if p = 2 and V ~ \tp), then an element of A^V could be expressed as = (ip^ \ ® (^"2 1- So, the action of on 
V (g)V would be 

^v)^ ((^ii ® (^^1) (1^1) ® ^-2)) = ((^il^i) ® (^-21^2)) 

*The superscripts on the v's are merely labels, not indicators of contravariance or covariance. Because 
many of our indices will be geometric (covariant/ contravariant) indices, while others won't, we will differ- 
entiate them by putting indices which are merely labels (and therefore carry no geometric significance) in 
parentheses, and the geometric indices without parentheses. Nonetheless, all non-geometric (purely label) 
indices on vectors will be superscripts, and on covectors will be subscripts. 
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specifically, if is a p-form and ^ is a g-form, then A ^ is a + g)-form. This definition 
preserves the antisymmetry. 

To illustrate this consider a 3-dimensional vector space with basis ej , eg, 83 (try not 
to think of these as the standard basis in 3-dimensional Euclidian space, but rather as 
general vectors as defined above - they can be Euclidian vectors, matrices, or anything 
else which obeys the definition of a vector space). We then naturally define a basis for the 



dual space as , e^, , obeying (2.1.1 1. Each e* is a 1-form, and therefore E V = Y* 
Vi, as we already know. This just means that each 1-form e* takes a vector to M. 

Then, we can wedge any two 1-forms together by taking the totally antisymmetric 
sum of each 1-form tensored together. For example, 

e-' A = e-' (g) - (g) e'' 
A e-^ = (g) - (g) e-' 



00 tj) /q ID 6) 

e'^ ^e'^ = e'^ (g e'^ - e'^ (g 



(2.2.5) 



(clearly, A = 0, etc. because e* A e' = — A e*). So for example the element 
ei ig) eg e V ® V will be operated on according to 

(e-' A e^)(ej (g eg) = (e'' (g - (g e-')(ej (g e^) 

= e^(e;) (g 8^(82) -e^(ej) ® 8^(82) 

= 1®1-0®0 = 1- = 1 (2.2.6) 

and 82 (g 8; according to 

(e-' A 8^)(82 (g 8^) = (8-' (g 8^ - 8^ (g 8-')(82 (g 8^) 

= 8^(82) (g8^(8i) -8^(82) ®8^(8i) 

= 0®0-l®l = 0-l = -l (2.2.7) 

We are only discussing the behavior of the basis vectors because of linearity - knowing 
the behavior of the basis vectors tells us how any point in the space acts. If we denote the 
1-forms as 0i, then an element 0i A 0i G A'^V is denoted 02- 

We can then form 0i A 0i A 0i = 0i A 02 = 03 ^ A^V in the same way: 

8^ A 8^ A 8^ = e-' (g 8^ (g 8-^ + 8^ (g 8'^ (g e-' + 8-^ (g e-' (g 8^ 



J ^.5 ^3^^^2^^1 e^®8^®8^ (2.2.8) 



-8 (g 8 (g 8 — 8 (g 8 (g 8 



(clearly, 8* A 8^ A 8^ = etc. because of antisymmetry). So, (2.2.8) will be the only element 
of A^V in three dimensions (up to a permutation with a sign change). 
And arbitrary element oiV i^V i^V will be acted on according to 

(8^ A 8^ A 8^)(8j (g 62 (g 85) = i 

{e^ A 8^ A 8^)(8j (g 85 (g 82) = -1 
{e^ A 8^ A 8^)(8j (g 8; (g 83) = C 

etc. (2.2.9) 
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The idea here is that the 0-f orms 0o form a one dimensional vector space with arbitrary 
element (in three-dimensions) 



a G 



(2.2.10) 



The 1-forms form a basis for a three dimensional vector space with arbitrary element 
(in three-dimensions) 



a e 



(2.2.11) 



(where a,b,c E M). The 2-forms 02 form a three dimensional vector space with arbitrary 
element (in three-dimensions) 



a(e^ A e^) + b(e^ A e" 



c(e^ A e-^' 



(2.2.12) 



(where a,b,c E M). Then, the 3-forms 03 form a one dimensional vector space with arbi- 
trary element (in three dimensions) 



a(e^ A A e^' 



(2.2.13) 



(where a G M). 

So, we have four vector spaces summarized as follows: 



p 


Dimension 


Basis 


Equivalent Total Space 





1 


1 


M 


1 


3 


e' 




2 


3 


e* A 




3 


1 


e* A e-'' A e'' 


R 



The symmetry in the dimensionality of each space is not a coincidence. It turns out 



that 



dim(AP1/) 



m 



p\{N -p)\ 



(2.2.14) 



(you can work out the combinatorics to derive this yourself). Much of this may seem 
abstract, but all we are doing is using wedge products of covectors as basis vectors for 
other spaces. For example we could define 



and then rewrite (2.2.121 as 



A 
A 
A 



(2.2.15) 



(2.2.16) 



By generalizing to a vector space of arbitrary dimension A^, we can easily see that the 
wedge product is 
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1) linear - 0p A (a0g + bcpr) = acpp A 0g + b(j)p A 0^/ where a,b eM. 

2) associative - 0p A (0g A 0^) = (0p A 0,) A (j)r 

3) graded commutative - 0p A 0g = (— A </)p 

A consequence of the third (graded commutative) is that (j)p A (f)p = ii p and p are both 
oddH 

Once we have a basis for A^V, we can expand an arbitrary element in terms of this 
basis. Admittedly, we have been sloppy with the normalization so far, but we were merely 
trying to illustrate the major points, not the details. We will now include the proper 
normalization. 

In general, an arbitrary element of A^V is 

1 ^ 

0P = ^ Yl ^n,i2,...,ipe'^ A 6*^ A ... A 6*'' (2.2.17) 

where the wedge products force cuj^ ..^j^ to be totally antisymmetric in all indices. The 
reason for this normalization can be illustrated in three dimensions. A general two-form 
would be expanded as 

uji2e^ A + ujsie^ A + cu^^e^ A e"^ + cj^^e'^ A H (2.2.18) 

The antisymmetry of the wedge product (e* A = — A e*) demands that cuij = —ujji, and 
therefore 

uji2e^ Ae^ + uj2ie^ ^ = cji2e-' A - co';2(-e-' A e^) H 

= uji2e^ Ae^ + uj2ie^ /\e^ ^ 

= 2cji2e^ A + ■ ■ ■ (2.2.19) 

So we normalize this with a factor of 1/2. You can write out more examples to see that we 
will always have a factor of 1/p!. 

Looking at our three dimensions example above, we know that an element of = M 
will simply be 

00 = (2.2.20) 

where G M. Clearly, there is only one degree of freedom here - uj is one dimensional 
(corresponding to the one dimensionality of A^V in the table above). 
Then, we know that an arbitrary element of A^l^ = V* is 

N 

(t>^ = Y,^^^' (2-2.21) 

i=l 

where uJi is a real number. The index i runs from 1 to 3, so this has 3 degrees of freedom 
(corresponding to the three dimensionality of A^V in the table above). 

^If you have never heard of graded commutative, don't lose sleep about. It is merely defined as stated, 
and knowing the definition is all that is necessary for now. 
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Then, an arbitrary element of IS?V is 



1 ^ 

E 



(2.2.22) 



where ujij G M, and the wedge product forces ujij to be antisymmetric in i, j {ujij = —ujji). 
Because i and j both run from 1 to 3, but uJij is antisymmetric, we once again have three 
degrees of freedom (corresponding to the three dimensionality of K^V in the table above). 
Finally, an arbitrary element of IS?V is 



1 ^ 
3! ^ 



ujijk e* A A 



(2.2.23) 



i,j,k=l 



where uJijk E M, and the wedge product forces it to be antisymmetric in all three indices. 
So, because i, j, and k all run 1 to 3, the antisymmetry only allows a single degree of 
freedom (once you have chosen one non-zero value of ujij^, the rest are determined by 
antisymmetry). This corresponds to the one dimensionality of A^V in the table above. 

Before moving on we briefly consider the transformation properties of (pi. It is clear 
that we can express a form in terms of a basis as in ( 2.2.21[ >, ( 2.2.22[ >, and (2.2.23). On 
the other hand, if the basis is understood, we can drop the explicit reference to the basis 
vectors and merely use indicial notation, referring to uji, Uij, ooijk, etcj^ We must merely 
be careful to note the range of each index and the fact that each element uJi-^,i2,.. 
totally antisymmetric in each index. 



IS 



We saw above in (2.1.101 and (2.1.111 how e* and ej transform under some transfor- 

transforms. Consider cUj G 0i. Now imagine 



mation T. We now want to see how cu, 
an arbitrary covector (in M?) graphed as 



^"This is exactly analogous to the choice to write a normal vector v in terms of a basis (v = ai + bj + ck) 
or in indicial notation (v={v'}, where — a, v'^ ^ b, and v'^ = c). However, if one were merely given the 
vector (1,2,3)^ with no other information, we wouldn't know if this was in Cartesian, spherical, or any 
other coordinates. That is why the basis must be known. 
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If we transform the basis vectors through, say, a 45° rotation without rotating the coordi- 
nates tUi, we will have 



which is clearly a different location in the space. So, in order to return the vector to its 
original location (described in terms of the transformed basis), we must rotate it back 45°: 



So, we can see that if we rotate the basis covectors e* according to (2.1.11 1, we must rotate 
the components uji in the "opposite direction", or by ( 2.1.10[ >. 

This generalizes exactly to (pp, where all indices of uJi^^i2,...,ip transform separately: 



00. 



(2.2.24) 



Finally, by the same line of reasoning it can be seen that the components of a vector 
transform as 



(2.2.25) 



Before moving on, we want to once again emphasize that these are transformations 
of components, not transformations of coordinates. For example, if we are talking about 
tensors in two dimensions, we may do a coordinate transformation from Cartesian to polar 
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coordinates. This is a non-linear transformation that cannot be expressed using a linear 
transformation like (T)*. However, for any vector (or covector) living in this space (or 
its dual), we can rewrite the components of that vector in terms of the new coordinates 
using the linear transformation (T)* at that point. The nonlinear coordinate transformation 
induces a linear component transformation at each point. We will discuss this more later. 

We can summarize what we have said so far as follows: vector spaces have bases with 
lowered indices which transform as in (2.1.101 for some linear transformation T. Dual 
spaces, or covector spaces, have bases with raised indices which transform as in (2.1.11 1. 
An arbitrary element of a vector space (in terms of basis e^, e^, etc.) will have compo- 
nents with raised indices (for example w *). An arbitrary element of a covector/ dual space 
(in terms of basis e^, e^, etc) will have components with lowered indices (for example 
'^h,i2-,ip)- Objects with raised indices (vector space components and covector/ dual space 
bases) transform the same way, while objects with lowered indices (covector/ dual space 
components and vector space bases) transform the same way. The transformation proper- 
ties of each are related by inversion - the transformation of raised (lowered) index objets 
is the inverse transformation as the lowered (raised) index objects. 

Recall the footnote on page 17 where we commented on the raised /lowered index 
convention we are using. The familiar notation where vectors have raised components 
and covectors have lowered vectors is now apparent in our construction. The basis vec- 
tors have "switched" indices (basis vectors are lowered and basis covectors are raised), 
but the components, which are usually what is being discussed in physics, align with 
convention. We are merely building these ideas in a more fundamental/ formal way than 
is common in physics texts. We will be very careful to maintain consistency with our no- 
tation throughout the rest of these notes - we will use raised indices when talking about 
vectors using indicial notation, but lowered indices when talking about the basis of a 
vector space. We will use lowered indices when talking about covectors using indicial 
notation, but raised indices when talking about the basis of a covector/ dual space (and 
indices in parentheses when when they are merely labels and neither is intended) 

To further tie this into familiar concepts, we say anything with raised indices trans- 
forms in a contravariant way (i.e. (2.1.101), and anything with lowered indices trans- 
forms in a covariant way (i.e. (2.1.11)). So, vectors and covector space bases transform 
contravariantly, while covectors and vector bases transform covariantly. If you are famil- 
iar with general relativity or tensor calculus, you have likely seen much of this in the form 
of tensors with upper and lower indices. Tensors with raised indices (T*) are contravari- 
ant, and tensors with lowered indices (Tj) are covariant. Everything that is typically done 
in the tensor formulation translates very naturally to the formulation we are using, with 
the only difference being that covariant tensors of rank p > \ are totally antisymmetry 
(which will be explained and justified later). What is typically not discussed in general 
relativity is the underlying structure of the dual space and its relationship to the tangent 
space: V -.V — > M. 

Admittedly it is not obvious at this point why we want the basis vectors and the com- 
ponents to have opposite indices (raised /lowered). Nor is it obvious how any of this 
relates to geometry. Both of these ideas will be made clearer as we proceed. 
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2.2.1 Exterior Algebras 



So, hPV is a vector space for each p. We will find it useful to combine each A^V into 
one larger vector space. Therefore, we can take a direct sum of all such spaces for a given 
iV-dimensional vector space V. We denote this space AV, and it is defined as 

N 

AV^ = K^V (2.2.26) 

Clearly the dimension of AV^ is the sum of the dimensions of IsPV for each p (for example 
above it was 1 + 3 + 3 + 1 = 8). We can also find the general dimension of an arbitrary AV^ 
(using equation ( 2.2.14[ >): 

N ^ / AT\ 

dim(A\/) = ^ dim(A^\/) = ( ^ ) = 2^ (2.2.27) 

p=i p=i V^/ 

It turns out that the 2^-dimensional vector space hV , along with the wedge product 



A, satisfies the definition of an algebra (cf section 1.1.51. Such algebras are called Exterior 
Algebras (they are also sometimes referred to as Grassmann Algebras). 

We have not spent a great deal of time building the concept of the exterior algebra, 
which (as the name implies) is a purely algebraic idea - not an obviously geometrical one. 
However, as we will see, this exterior algebra will prove to be a powerful geometric tool. 

Before we begin the uphill climb towards a full geometric theory, we will look at a 
few examples of how these simple forms (which we will discuss in for simplicity and 
familiarity in this section) can be looked at geometrically. 

Consider, in M?, the set of all 1-forms (0i, cf. equation (2.2.21 1), where we are assuming 



the basis e^, e^, and e-^ is given. Consider two specific vectors in this vector spacep] 

= ujiG^ + cjge^ 
^2 = uj[e^ + uj'^e^ (2.2.28) 

Then, the wedge product will be 

= ujiuj[{e^ A e-') + uiuj'gie^ A e^) + uj2uj\{e^ A e-') + cu^^^sle^ A e^) 

= - uj2uj[){e^ A e^) (2.2.29) 

where we made use of the antisymmetry of the wedge product to get the last equality. 
Notice that this has the form 

n,An2 = det f"^] '^^^ (e^ A e') (2.2.30) 
and for this reason we can see that the coefficient of (e^ A e^) is the area of the parallelo- 



gram formed by the covectors (2.2.28| 



^^We are temporarily limiting ourselves to two dimensions here for simplicity - we will build to three 
dimensions shortly. 

^^Note that this area is directed, meaning that it can be negative - this is an important property and we 
will say more on this later when we discuss orientation. 
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Now, consider the three dimensional case with 



rig 
fig 



So, we can see 

fi^ A fig 



+ {uj2Uj';i -cj3CJ2)(e^ Ae-^) 



Notice that, similarly to (2.2.30 1, we can write this a^ 

n,An, = deth ^?)(e^Ae^) + detf^^ 



(e^ Ae" 



+ det 



det 



(e' A e' 



UJ2 ^3 

A e-^ e-' A gP A e^^ 



UJi 



002 



Us 



Then, we can write (sparing some tedium) 



(2.2.31) 



(2.2.32) 



(2.2.33) 



Cli A fig A fij 

r / / // / //\ I I II I ii\ , I I II I ii\ \ I 1 ^ 2 a 3\ 



(2.2.34) 



^^Note that this looks very similar to the form of the cross product A x B learned in Ek.M. While this is 
not exactly a cross-product, it is related, and we will discuss the relationship later in these notes. 
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We can rewrite this as 

A fig A fi^ 

u[ u'^ uj's I (e^ A A e"^) (2.2.35) 
u'l oj'i UJ'J, 

which we can associate with the volume of the (three-dimensional) parallelepiped 



spanned by the covectors (2.2.31 ): 




We are finding that wedge products can be written in terms of determinants. This is 
not a coincidence, but rather is part of much deeper geometric machinery. We will explore 
this later in these notes. 

In summary for this section, we see that even in these simple examples of forms in M? 
there are clear geometric interpretations. However, our reasoning here has depended on 
the fact that the space, or manifold]^ we are in is M?. For less trivial spaces, we will have 
more work to do in later sections. We are merely trying to give some idea of how these 
objects can have geometric meaning before branching into a fuller geometric exposition. 

The point for now is to understand the the set of all forms, made of antisymmetric 
tensor products (or wedge products) of the covectors in the dual space, form an algebra 
(as defined in section 1.1.5[ >, where each element contains both algebraic and geometric 



information about the types of objects that can exist in the space. 

2.2.2 Vector Space Valued Forms 

Previously we discussed p-forms which act on vectors in some vector space V , map- 
ping them to M. For example (in two dimensions) the tensor product 

a^^{ei (g) 8;) + a^^{ei ® e^,) + a^^(ea <^Gi) + a^^{e2 eg) (2.2.36) 

(where a*-' G M V z, j) is acted on by the 2-form 

6i2(e' A e^) = &,a(e' -e^0e') (2.2.37) 



^^If you aren't familiar with what a "manifold" is, just replace that word with "space". We will talk about 
manifolds later in our exposition. 
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(where 612 G M) to give 

= bi2a^\l ® 0) + bua^^l ® 1) - 6i2a'^(l ® 1) + 6120^2(0 ® 1) 

= 6i2a^2 _ ^^^^21 ^ ^ (2.2.38) 



In other words, the form (2.2.371 maps the tensor (2.2.36) to a real number. We now want 
to consider the possibility of a form mapping a tensor to an element of a vector space. 
Specifically, we will be working with the vector spaces C, a Lie algebra, or a vector space 
which carries a representation of some Lie algebra (meaning that it transforms under that 
representation of that Lie group). 

Consider vector spaces V and W. V will be the vector space whose dual space contains 
the forms we will be considering (in other words, V means the same thing it did above). 
W will be the vector space the forms will map V to. 

Our notation will be as follows: 

A^(V, W) = {(f) : V'^^ — > W, multilinear, antisymmetric} (2.2.39) 

which reads "Ap{V, W) consists of the set of all multilinear and antisymmetric functions 
which map tensor products on the vector space V to elements in the vector space W. So 
A^(V, W) is simply the set of all linear maps from V to W. 

The best way to think about this is that an element of Ap(V, W) is a vector in W whose 
components are not real numbers, but rather real valued p-forms. To see this assume the 
dimension of W is w, and let a*, i = 1 , ■ ■ ■ w,he a basis for W. Then, we can express any 
G AP{V, W) as 

w 

= ^a>, (2.2.40) 

i=l 

where 0^ G AW. 

As a simple example, consider y = and W = M? and p = 2. Then (using (2.2.401 as 
a guide, we want (pi G A'^V. Then, we choose the standard basis (a^ a^) for ly = M^. So, 
an example of an element of A^(]R^, M^) would b^ 

a^ {b,2e' A e') + a^lc^^e^ A e^) (2.2.41) 

(where 612, C23 G M). If this acts on an arbitrary tensor]^ 

d^\ei ® ej) + d^^ei ® e^) + d^^{e3 ® e^) (2.2.42) 

(where d^^, d^^, rf^^ G M), we get 

{sL^{bi2e^ Ae^) + a^(c52e^ A e^)) (d^^(ej ® e^) + d^^(e^ ® e^) + (^■^^(85 O eg)) 
= bud'^a'il 1)- C23d''8i'{l ® 1) 

= b,2d^^a' - C23d''k' G (2.2.43) 



^^This is not the most general element, just an example. 
^^This is not the most general tensor, just an example. 
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So, the form ( |2.2.41| > maps tensor ( |2.2.42| > to M^. 

Notice that the form (2.2.411 has a problem if we want to define a wedge product 
between these forms. Consider, for example, trying to wedge (2.2.41 1 with, say, 

a^(/,e^) + a^(/5e^) (2.2.44) 

we get 

[§.'{b,,e' Ae') + k'{c2se' Ae')] A [(a^ (/,e^) + a^(/,e^)] (2.2.45) 



This cannot be defined unless we have some well-defined way of multi plying (unit) vec- 
tors in W. If we do assume such a multiplication (denoted a' o a^), then ( 2.2.45[ > becomes 



bufsiSi' o a'^)(e^ A A e^) + cg,/, (a^ o a^)(e^ A A e^) 
[&i2/3(a^ o a^) + C2,h ° )] (e^ A A e^) 



(2.2.46) 



As another example, let W be the Lie algebra of, say, SU (2). This algebra consists of 
three generators T^,T'^ and T'^j^ In this case, the multiplication of vectors in (2.2.461 is 
the commutator. So, (rewriting a* as T*), we have 



[b,2f3[T\T'] + C2ih[T\ T']] (e^ A A e^) 



(2.2.47) 



So, when we generalize to vector space valued forms, we must have a clear way of 
multiplying the vectors in W. 

2.2.3 Fullbacks 



Consider two vectors spaces M and N, of dimension m and n respectively. Now, 
consider a linear map / from M to N: 

f : M — > N linear 

In general, this map cannot be uniquely inverted. Consider the very simple example 
where M = M^, N = R, and f{x,y) = {x + y), where x,y,f{x,y) G M. Given, say, 
f{x,y) = 8, there are an infinite number of choices for x,y which produce this ((8,0), 
(10, -2), etc). 

While we cannot in general invert the mapping /, it turns out that / does create a well- 
defined map from the dual space of N to the dual space of M. We denote this "induced" 
map /*, and say 

: A^A^ — > A^M (2.2.48) 

Note that / mapped from M to A^, whereas /* maps from to M. 

^^We could specify a specific representation, for example the fiindamental representation with the Paul 
matrices, but we will maintain generality for now. 



30 



We define /* as follows: given a tensor product of p vectors in M, which we will 
denote {m^-^\m^^\ . . . , m*^^^), we know that it is mapped to a tensor product of p vectors 
in by / as 

(in(^),m(^),...,m(P)) ^ (/(m(^)),/(m(^)), . . . ,/(m(f))) (2.2.49) 



(where /(m*^*)) is understood to be a vector). Acting on the right hand side of (2.2.49 1 with 
a p-f orm in the dual space of (denoted 0p), it is mapped to M: 

0,(/(m(^)),/(m(^)), . . . ,/(m(^))) ^ M (2.2.50) 

The map /* will, as stated above, map a p-form (pp in the dual space of (denoted A^*) to 
a p-form in the dual space of M (denoted M*), which we denote il)p. We define the exact 
action of /* on 0p G A^* as follows: 



= 0,(/(m(^)),/(m(^)),...,/(mW)) (2.2.51) 

We call /* the puUback induced by /. Admittedly, this definition is cumbersome 
and difficult to follow. Bear with us for a few more lines, and then we will consider an 
example. 

It will be helpful to write out /* in terms of specific bases of M and A^. Again taking 
the dimension of M to be m and the dimension of A^ to be n, let aj for i = 1, . . . , m be a 
basis for M, and let bj for i = 1, . . . , n be a basis for A^. Then let A* for i = 1, . . . , m be a 
dual basis for M*, and let B* for i = 1, . . . , n be a dual basis for A^*. Knowing /, it is easy 
to write out the mappings from the basis for M to the basis for A^, generating an n x m 
matrix /j (where i runs 1 to ri and j runs 1 to m): 

n 

/(a,) = ^//b, (2.2.52) 

i=l 

The components of f* with respect to the dual bases is then given by 

m 

r(B0 = 5^//A^' (2.2.53) 

Due to linearity, knowing how /* acts on the basis covectors tells us how it acts on the 
entire space. 

We now consider an example. 

Let M = and A^ = M^. Then let / be defined by 

/ : c'sii + c^ag ^ {c' + 3c')hi + (c")bg + {c' + 2^)hs (2.2.54) 

Clearly this cannot be in inverted in a well-defined way. But, using ( 2.2.52[ > it is easy to 
write out the 3x2 matrix /j. We have 

a^ I — > bi + b^ 

ag I — > 3hi + bg + 2h3 (2.2.55) 
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so 



'I 3^ 

/;=(o 1 

1 2. 



(2.2.56) 



So, using (2.2.53) we can write out the action of /* on the bases of the exterior algebras: 

/*(B^) = A^+5A^ 
r(B^) = 

r(B^) = A^ + 2A^ (2.2.57) 
To see what we have done, consider the vector 

2aj - 5as e M (2.2.58) 

This will be mapped by / to 

- 7hi - 3h2- 4h3 e N (2.2.59) 

Now consider the 1-form 

2B^ - + B^ G yl^ (2.2.60) 



This will map ( |2.2.58| > to M as 

(2B^ - 2B^ + B^) ( - 7hi - Shz - ihs) 
= -14 + 6-4 = -12 

Now we can use ( 22.57) to induce /* on ( 2.2.60[ >, giving 

/*(2B^ - ^B^ +B^) = 2/*(B^) - ^r(B^) +r(B^) 

= 2(A^ +5A^)-^(A^) + (A' +^A^: 
= 3A^ + 6A^ 

Finally, acting on ( 2.2.58[ > with (2.2.62 1, we get 

(3A^ + 6A^) {2aii - ^a^) = 6 - 18 = -12 



(2.2.61) 



(2.2.62) 



(2.2.63) 



which agrees with (2.2.61 ). 

It is instructive to think about this example in terms of the more general definition of 
/* given in (2.2.51 ). 

Once we know how how /* behaves on A^A^, we can extend it to arbitrary A^A^ using 
the following properties: 

1) linearity: r{a<P, + h<P'^) = ari(j),) + 6/^(0;) 

2) n<PpA<P',) = {r<Pp)A{f%) 

Admittedly, the point of the puUback f* likely isn't clear yet. It will, however, become 
more apparent as we delve further into differential geometry. The important thing to 
remember for now is that if we have two vector spaces M and A^ and a map between 
them, then any p-form on the target manifold can be "pulled back" to a well-defined in- 
form on the original manifold. We are introducing the idea now to have the building 
blocks established when we come back to them. We ask for your patience until we can 
make their clearer. 
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2.3 Differential Manifolds 



2.3.1 Informal Definition 

In the next section, we will begin studying vectors, tensors, and forms that live on 
more general differentia^^ manifolds than M". But before looking at what types of objects 
can live on a general manifold, we must understand what a manifold is. 

To get a feel for what a manifold is, we begin with an informal, more intuitive descrip- 
tion/ definition. 

In simple terms, an ra-dimensional manifold is a space Ai where any point x E ^A has 
a neighborhood that looks like M". It is not necessary, however, that A4 look globally like 
M". For example consider the circle (denoted 5*^). Clearly is a one-dimensional space, 
but globally it (obviously) looks nothing like M}, the real line. However, if you zoom in 
on a small section of around any point x G S^, it does look locally like a small section of 
M}. And the more you "zoom in", the more it looks like M} (that is to say the more "flat" it 
looks). 



Another example is the Earth. From most locations on the Earth, it appears that the 
surface is flat. So it is locally similar to M?. As we know, however, the Earth does not have 
the global structure of M^, but rather of the two dimensional sphere, denoted S"^. 

Another example of a manifold is the two dimensional torus T^. 

^^We will drop the word "differential" and merely refer to manifolds, though we will be working with 
differential manifolds throughout this paper. 
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Locally, it also looks like M^, but globally it has a very different structure. 
On the other hand, the "jfigure-eight" space shown below 




is not a manifold. Every point in this space is locally similar to M} except the point in 
the middle. No matter how small a neighborhood you choose, it maintains the general 
structure of a cross, which is not similar to for any n. 

Notice that a space being a manifold depends on its topological structure, not on its 
geometric structure. This means that it depends on the qualitative shape rather than the 
quantitative shape. To illustrate this, consider a square and a circle. They are topologically, 
or qualitatively, the same thing. They are both one dimensional loops. In fact, circles, 
squares, rectangles, ovals, anything that a circle can be stretched, twisted, etc. into without 
breaking it. 
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A 



Quantities like size, angle, distance, etc. don't matter. Quantitatively, however, circles and 
squares are obviously very different. In this sense squares and circles are geometrically 
different. 

2.3.2 A Slightly More Formal Definition 

We now give a formal definition of manifolds to make the previous section more pre- 
cise. 

An n-dimensional manifold is 
1) A topological space A^J^ 



^^A topological space is a formal mathematical idea, which you are encouraged to read about if you are 
not already familiar. However, because we are doing physics, not mathematics, all of our spaces will triv- 
ially meet the requirements of a topological space (spacetime. Lie group space, etc). You should eventually 
understand these things well, but doing so is not necessary for this paper. 

^"We have set in bold font because it is an n-dimensional vector. However, the index is merely a 
label, not an indication of covariance or contravariance, as indicated by the parentheses. 

^^Saying that the set of all {Ui}'s "covers" A4 means that the union of all of the Ui's is equal to A4: 



In other words, we have split Ai up into a collection of (generally overlapping) sections Ui, and when you 
take the union of all of those sections, you have the entire space A4 . 

^^A map is "homeomorphic" if it is continuous and if it has an inverse that is continuous. The intuitive 
meaning is that two spaces are "homeomorphic" to each other if they are topologically equivalent. For 
example the circle and the oval (and the square and the star) are homeomorphic to each other, and therefore 
one can construct a homeomorphic map between them. 

^■'An "open" subset is a subset such that for any point x E Vi C K", there exists some e > such that 
the n-dimensional ball of radius e around x is entirely contained in Vi. For example, consider the subset 
[0, 1] C M^. This contains the point {1}, and any e-neighborhood around {1} will contain the point 1 + e. 
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5) Given two "sections" of M, Ui and Uj, such that the intersection f/j fi Uj ^ 0, the 
mapp^'0'-'-^^(x) = 0'''^(0^^(^^(x)) (where x G C M" and x is an n-dimensional vector in 
Vj) is smoothj^ 

That definition is quite a bit to take in, so we will spend some time explaining what it 
means. We did our best to relegate secondary mathematical ideas to the footnotes. If you 
aren't familiar with them, we encourage you to read them in the recommended further 
reading, or at least the relevant Wikipedia pages. 

The picture we will be working with is the following: 



M 




We have some space M. that is our n-dimensional manifold. We divide it into several 
sections Ui which will generally each cover a different part of Ad with some overlap. All 
together, the fZ/s cover the entire space M. 

For each of these sections Ui, there is a homeomorphismp^ 0*^'^ that maps points in 
Ui C M. io points in an open subset of a copy of M". For example, if the manifold is 

which is not in [0, 1]. On the other hand, the subset (0, 1) does not contain the point {1}, but rather any 
point of the form \ — 5 where < (5 < 1. Clearly, we can define e = |, and the one dimensional "ball" 
defined by all points p satisfying \ — 5 — €<p< 1 — (5 + eis contained in (0, 1). 

^*the map is not bold because it is mapping a vector x in M" to a point in M., which we do not take as 
a vector 

^^Smooth means that it is infinitely differentiable. To a physicist, this simply means that it is a well- 
behaved function. 

^^We should make a somewhat formal comment at this point. We referred to the ^'-'^'s as vectors above. 
This isn't entirely accurate. There is a formal mathematical distinction made between "points" in M" and 
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3-dimensional, then each point p G M., will have an x,y, and z component (in Cartesian 
coordinates). So, there will be (j)^^^'^, 0*^*^'^ and 0*^*^'^ for every i. 

The fact that this map is a homeomorphism simply means that the section Ui has the 
same topological structure as M". This in turn means that when we choose how to break 
M up into sections (f//s), we must only choose sections that are homeomorphic (topolog- 
ically the same) as a section of M."'. 

The pair {Ui, <p^^^) is called a chart, while the collection of all charts {(Ui, <p^^^)} is called 
an atlas. The section Ui is called a coordinate neighborhood, while 4>^^^ is called the 
coordinate function , or simply the coordinates, of the section Ui. 

So, we have a section oi Ui C ^A that is homeomorphic to some section of M", and 
then some map 0^*'' that assigns each point p G f/j to a point in that section of M". In other 
words, <p^^^ allows us us to label every point in U with a unique n-dimensional point in 

(which is why we call (p^^'^ the "coordinates" of Ui). 

This is the content of the first 4 parts of the definition of a manifold. 

The meaning of the fifth requirement is a bit less obvious. If two coordinate neighbor- 
hoods Ui and Uj overlap, there is some collection of points p E M such that p E UiCi Uj. 
Therefore, the coordinate functions 0^*^ and <p^-'^ will each map p to a point in an open 
subset of M" (cf the picture above), and there is nothing to guarantee that they are the 
same point in M" (in general they will not be the same point in M"). So, start with the 
point p mapped to by either coordinate function, say 4>^^\ in c M". Then, we can take 
this point in Vj and use <p~^^^^ to map it back to p e Uj fi Ui. So, we have gone from Vj to 
A4. Now, we can take the point p and map it back to M", but this time using <p^^^ (instead 
of 0(^^), taking ptoViC M". 

So what we have done is mapped points in Vj C to points in Vi C M". We call this 
composite mapping (from Vj to Vi) It is merely a way of moving points around in 

open subsets of M". In order for 7W to be a differentiable manifold, we demand that it be 
possible to choose our atlas so that i/'*^'^-' is infinitely differentiable for all i and j (if Ui and 
Uj don't overlap then there this condition is met for ■0'^'^^ trivially). 

To make this more clear, we consider a few examples. The simplest example is obvi- 
ously M". We can choose our atlas to consist of the single chart with Ui = M" and (Z)*^^-* the 
identity map. 

As a second example, consider the circle 5*^: 



"vectors" in the vector space M". A vector is an object satisfying the properties in section 1.1.4 A point 
is merely an n-tuple which defines a point in the space. In physics we typically blur this distinction. For 
example we treat the point x=(a;,7/,2)^asa "position vector", when in reality it is the vector extending 
from the origin to the point that is truly the vector. This distinction is, for us, not important. We make this 
comment simply to acknowledge the distinction. While we actually mean that is merely an n-tuple in 
M", we will think of it as a vector in everything that follows. This lack of mathematical formality will have 
no affect on the contents of this paper. 
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Clearly it it a one-dimensional space, so we want to find an atlas that maps coordinate 
neighborhoods to M^. An initial (naive) guess might be to use only one coordinate neigh- 
borhood going from a point on the circle all the way around, as follows: 




We then have a lot of freedom with how we choose (a scalar because this is one di- 
mensional). The natural choice would be to define in terms of the angle around the 
circle: 




so that (j) maps to the interval [0, 27r). But this is the problem. The definition above states 
that the coordinate functions must map points in to open subsets of R". But [0, 27r) is 
not an open interval. So, this first guess doesn't work. 

It turns out that, because of the topology of S^, we cannot define the atlas using only 
one coordinate neighborhood. We must use at least two. One possibility is to define two 
coordinate neighborhoods, each like the one in the naive approach above, with the one 
difference that they are open on both ends. We could allow their "origin" to be at any two 
places on the circle, but without loss of generality we take them to be on opposite sides 
of the circle: 
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Obviously, neither neighborhood alone covers the circle (they are both missing the point 
at their respective origins). But, equally obvious is the fact that their union covers the 
entire circle. For either neighborhood's missing point, the other neighborhood covers it. 

So how do we define 0*^^-' and 0*^^^? We can define them in the same way as in the naive 
example above (except there will be no for either of them). Now every point except the 
two origin points will be in C/i n C/2- So, consider an arbitrary point p in the overlap: 




This will be mapped to | by and ^ by (f)^'^\ More generally, any point on the right 
side of the picture will be mapped to d by 0*^^^ and | + by 0*^^^ Points on the left side 
are mapped to 9 by 0^^^ and | + 6' by 0*^^^. Obviously these maps are linear, and therefore 
smooth (infinitely differentiable). 
Another example is the sphere S'^: 
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It is a two-dimensional space, so we expect the coordinate functions map to M^. Once 
again, the topology prevents us from using only one coordinate neighborhood. Once 
again, however, we can use two: one including every point except the "north pole", and 
another including every point except the "south pole". Each of these neighborhoods can 
be mapped to an open neighborhood of M^. You can write out your own maps 0*^^^ and 
0*^^-' and check to see that they are smooth. 

It would be instructive to figure out on your own how many coordinate neighbor- 
hoods are necessary for the torus (pictured above on page 34). See if you can write out 
the coordinate functions for each patch and show that the xjj'^^s are smooth. 

We should comment that it is also possible to have a manifold with a boundary. This 
allows that some or all of the coordinate neighborhoods Ui are homeomorphic to an open 
subset of M" where one of the coordinates is greater than or equal to 0. We won't be deal- 
ing with manifolds of this type in this paper, so we won't spend any more time discussing 
them. 

The reason we define manifolds in this way, with a set of abstract points and collec- 
tions of charts homeomorphic to subsets of M", is that it allows us to discuss manifolds as 
coordinate system free objects. In other words, we can talk about points in p without hav- 
ing to define a coordinate system. We know that all physics should be coordinate system 
independent, so this formulation is especially useful. The manifold exists apart from any 
coordinate system designation. 

On the other hand, because we have coordinate neighborhoods that map homeomor- 
phically to W\ and because we know how to deal with M", we haven't lost calculational 
proficiency. We can map the manifold to sections of M", and do any calculations we may 
want there, knowing that both the homeomorphism structure and the smooth transitions 
between the target spaces of coordinate functions will preserve any and all structure. 

A few moments reflection should make it clear that this more formal definition is 
exactly equivalent to the informal one given in section 2.3.1 Requiring that each Ui be 
homeomorphic to an open subset of M" is analogous to saying that every point has a 
neighborhood that looks like M". The motivation for demanding that the maps i/?'-*^'' be 
smooth will become apparent when we begin doing calculus on manifolds. 

Also, recall that at this point we have the freedom to define the coordinate functions 
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however we want. We can map a particular Ui to a small (open) n-sphere in or to a 
large oddly shaped subset. Because the map need only be a homeomorphism, you can 
literally choose any open subset of that is homeomorphic to Ui without doing any 
damage to the manifold structure. This leads to obvious shortcomings - how can you 
define distance on the manifold? For example, mapping a coordinate neighborhood of S'^ 
to an open disk in with radius 1 is just as good as mapping it to an open disk in 
with radius 1,000,000,000. Clearly the distance between the points in the Euclidian space 
will be different in the two cases, but that doesn't change the "distance" between them on 
M. 

Also, because we are merely demanding that the coordinate functions are homeomor- 
phisms, they don't really tell us anything about the geometry of the manifold. We only 
have information about its topology. For example, there is no way to differentiate be- 
tween a "geometrically perfect" sphere S"^ and an "egg" manifold (a sphere stretched out 
a bit). 

We will eventually deal with these and other problems when we impose the appro- 
priate structures on . For now, we will merely work with the structure we have, where 
M. is simply a topological idea. 

2.3.3 Tangent Spaces and Frames 



In section |Z2| we talked about tensor fields and forms that exist in W^. This was inter- 
esting, but we want to begin generalizing to less trivial manifolds. This section will begin 
that process. 

Consider some n-dimensional manifold Ai. For every point p in some coordinate 
neighborhood Ui with coordinate functions 0^ mapping to M", we can define the Tangent 
Space at point p as follows]^ 

TpM = {p} ® M" = {(p, v) I V G M"} (2.3.1) 

This means that at the point p we have attached a copy of the vector space ]R"|^ Tangent 
spaces are real linear spaces with addition given by 

a(p, v) + b{p, u) = {p, av + bu) 

You can effectually think of the first element in this notation (the p) as merely a label spec- 
ifying what point in Ai the vector v is associated with. Note that the p in the definition 
of TpM is in M, not a point in an open subset of M" that a coordinate function maps to. 
Defining TpAi this way, in terms of point in Ai rather than in terms of where some 0*^*^ 
maps p, prevents redundancy issues regarding which was used in case p E Ui n Uj. 

For example, consider 5*^ Coordinate neighborhoods on 5*^ are homeomorphic to 
open subsets of M}, so for a given point p, we attach every vector in the vector space R^j^ 

^^In a slight violation of our notation, we are merely labeling the vectors in M" in this definition using 
bold characters. They should be considered to be vectors, not covectors. We will "clean up" our notation 
later. For now, just take them to be vectors in the vector space M". 

^^We are here referring to the point p e M, not to the target point 0^*-' (p) £ M". 

^^Note that the tangent space is all of M", whereas the neighborhoods of M are homeomorphic to open 
subsets of M". 
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In other words, we are attaching a line {M}) to every point. So, each point has its own 
copy of 




As one final example, consider the "open ball" in three dimensions. To see this, imag- 
ine the sphere S"^, fill in the interior, and then remove the boundary of the sphere. In other 
words this is the set of all points (x, y, z) in three dimensions such that + y"^ + < r 
for some radius r > o|^ We only need one coordinate neighborhood Ui for this manifold, 
and cj)^-^^ can merely be the identity map to an open subset Vi C M^. So, because the coor- 
dinate neighborhood is homeomorphic to M?, the tangent space will be M?. So, each point 
will have a copy of M? "attached" to it. 

For reasons made clear by the above examples, the elements of TpJiA are called the 
tangent vectors to the point p. The tangent space associated with each point is the space 
of all directions you can move in in the manifold at the point p. 

However, we make the important note that, mathematically, it is best not to think of 
the tangent spaces as "attached" to in a direct geometrical way, despite the indication 
of the pictures above. The tangent space is a different space than the manifold. 

So, we are working with multiples spaces. First is the manifold M. which is merely 
a collection of points with certain topological properties and has no specific coordinate 
system. M. is then divided into coordinate neighborhoods, each of which are a part of M., 
but are required to be homeomorphic to an open subset of M". Then, for every coordinate 
neighborhood, we have another space, M". There is a separate copy of M" for each coordi- 
nate neighborhood, and they are all distinct spaces both from each other as well as from 
M.. Finally, we have the tangent spaces, which consists of an infinite number of copies of 
M", one for each point in AA. Again, each of these copies of W should be considered dis- 
tinct from each other, M., and from the open subsets of M" which the coordinate functions 
map to. 

Let's consider a simple physical illustration of a tangent space. Imagine a fly con- 
strained to move on the surface of S"^. For some path he moves through, at each instant 
we can assign a position vector, which will be a point p E S"^ with coordinates 0j G M^. 
Also, at each point he will have a velocity, which will be a vector "tangent" to the sphere, 
and is therefore in the tangent space of S"^. Notice that the fly cannot have a velocity vec- 
tor that is not tangent to the point because it is constrained to fly in the space only, and 
therefore the space of all velocity vectors at each point is two dimensional. So, the phase 
space of the motion of the fly will include, for each instant, a point in the space along with 
a vector in the tangent space of that point. 

In the same way, if the fly is constrained to the open ball in M^, it can be at any point 
in the open ball, but it is free to have a velocity vector anywhere in the three dimensional 
space. 

Phase space as in the two previous examples represents one of the simplest physical 
examples of a tangent space. However, we should note an important mathematical point. 
We said above that we should think of the tangent space as a distinct space from the actual 
manifold, not as being geometrically attached to it. The fly examples illustrate this in that 
the vector representing the velocity of the fly (which is a vector in the tangent space) does 

'"'On the other hand, the "closed ball" in three dimensions is the set of all point {x, y, z) in three dimen- 
sions such that a;^ + + < r > 0. One includes the boundary, the other doesn't. Clearly, the open 
ball is an example of a manifold without a boundary, while the closed ball is a manifold with a boundary. 
The coordinate neighborhood of the closed ball that covers the boundary will be mapped to a subset of M.^ 
where one coordinate is greater than or equal to zero. 
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not depend at all on the location of the fly. If we speak in terms of the coordinate functions 
0j and assign coordinates to the location of the fly, there will be a vector in M" (the target 
space of a 0*^*^) which represents the location, and an independent vector in a different 
copy of M" (the tangent space at that point). The two vectors are not related, and for this 
reason we think of the tangent space at a point p being "attached" to the manifold at p, 
but not geometrically dependent on it. 



We write tangent vectors in pairs (as in (2.3.1)) because a particular tangent vector 
refers specifically to a point in Ai. For example it would make no sense to add tangent 
vectors at two different points: 

a{p,^r) + b{q,u) = ? 

There is no natural connection between the tangent vectors at p and q (unless, of course, 
p = q). For this reason, we take TpM and TqM to be completely different vector spaces. 
This is what we meant above when we said that "each point p & AA gets its own copy of 
the tangent space M"". 

However, we can consider the union of all these vector spaces, which we refer to as 
the tangent bundle TM of M: 

TM= [j Tj,M=M®W' = {{p,^r)\p e M, ^r E M"} (2.3.2) 

At this point, we are using the term "bundle" is a loose sense. A tangent bundle is a 
very simple example of a fibre bundle, which we will discuss in more detail later in these 
notes. For now, just think of it as attaching a space (M") to every point in another space 
(M), creating a space of dimension 2n. 

Before moving on we mention one idea that will seem unhelpful and even a little silly, 
but will be extraordinarily useful later. We want to introduce it now so that it isn't entirely 
new when we come back to it. As we said, each point p E Ai has a copy of the tangent 
space M". So reversing this, a given vector in the tangent bundle, {p, v), can be associated 
with a specific point p. We therefore define the projection map vr, which takes a tangent 
vector and maps it to the point in M it is "attached" to. So, 

71 : TpM — > Ai 

(p,v) ^ p (2.3.3) 

Obviously for any two tangent vectors v*^*) and v'^-'^ (with v*^*) ^ v*^-'^), 

7r((p,v«)) = 7r((2),v(^-))) 

because they are both attached to the same point p. 

Also, for any point p, we can use the inverse of vr to recover the entire tangent space: 
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-\p) = TpM (2.3.4) 



As we said above, taking time to specifically define a map that does something so appar- 
ently trivial may seem unprofitable at this point. When we come back to it, however, it 
will be a very powerful tool. 
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Moving on, we have now discussed manifolds, coordinate neighborhoods and coordi- 
nate functions, and tangent spaces. A tangent space at a point is, as we have said, a copy 
of M". And, as you should be quite familiar with, we can represent any point in M" with 
a set of basis vectors. It is then natural to seek a way of finding a basis set for a tangent 
space at a point. The way this is done will prove to be one of the most important and 
foundational results in differential geometry. 

Using our standard notation, considering a manifold Ai with coordinate neighbor- 
hoods Ui and coordinate functions 0^*^, we can take the (^'•*-*'s to be whatever coordinate 
system we want. We could take 0*^*^'^ = x, 0*^*)'^ = y, = z, etc. for Cartesian coordi- 
nates, or we could set 0^*^'^ = r, 0^*)'^ = 9, etc for spherical coordinates, and so on. We can 
choose any coordinate system we want in how we map elements p E A4 to open subsets 
of M". 

So, given a point p with chosen coordinates 0'-*^ (in an n-dimensional manifold there 
will be n components, so <p^'^^ is an n-component vector), we can define the set of vectors 



d 



90(0,. 



P, 



dcj) 



G TpM 



(2.3.5) 



To illustrate this, let M = S'^, and let the <p^^^ map to Cartesian coordinates. Then, 
4>^^^ = {x,y)'^ (because the coordinate neighborhoods of S"^ are homeomorphic to M^). So, 
the vectors are 
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^' \ (9x ' dx 
/ dx dy 
^' \dy' dy 



(P,(1,0)^) 
{P, (0, 1)^) 



(2.3.6) 



for any p E Ui. Notice that the two left most expressions in (2.3.6 1 form a basis for the 

(2.3.7) 



tangent space, so any vector in the tangent space at a point p can be written as 

d 



ve TpM = v'{p,{l,Of) +v'{p,{0,lf) =v' 



90(0 J 



E 



We have dropped the | in 



where the summation convention is in effect and 
the notation for now, but this should be understood as referring to a a specific point. 



The form of (2.3.71 seems to beg for something on the right for these differential oper- 
ators to act on. This intuition is correct, but we aren't ready to discuss what they act on. 
We will get to that later in these notes. 

Notice that the index on the left hand side of (2.3.61 is lowered^] which is consistent 
with our convention that vector components have raised indices (0^*^'^), whereas the basis 
vectors of a vector space have lowered indices. 



The general form (2.3.5 1 will always produce a set of basis vectors which, for any point 



p E M, will form a basis for the tangent space at p. Furthermore, because of the properties 

^^When an object with an index appears in the denominator, the index is "switched". For example the 
upper index in is a lower index in and the lower index in Xi is an upper index in 
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of the coordinate functions, this basis will vary smoothly from point to point in M.. Such 
a smoothly varying basis over the tangent bundle is called a frame on the tangent space 
of the manifold. 

Of course, the basis gjljj-i is entirely dependent on the coordinate functions you 
choose. We used the standard x and y above. But what if we wanted to use a differ- 
ent basis instead? We know from the definition of a manifold that the transformation 
functions i/?^*^^ are smooth. Therefore for any coordinates a;* there will exist some smooth 
mapping to some new coordinates x'^ = x'^{x^) (meaning that x'^ is a function of all of the 
X*). This map, from x to x', is exactly the t/?*^*-'^ discussed above, and therefore we know 
that it will be smooth (infinitely differentiable). So, whereas the original frame is 

using the chain rule and exploiting the knowledge that the i/^^^^^'s are smooth, we can 
write the new frame as 

^ ^ (2.3.9) 



dx'^ dx'^ dx^ 



Note that this is a linear transformation (^ is exactly analogous to {T)l from (2.1.101), 



whereas x'^ = x'^{x^) is not necessarily linear. For example, consider the (nonlinear) 
change of coordinates from Cartesian to polar (in two dimensions): 

x = rco^9 y = rsm6 (2.3.10) 

This is a nonlinear mapping of coordinates, not vectors. In polar coordinates, the frame is 

— and — (2.3.11) 

ox ay 



Using (2.3.9 1 we can easily write the new frame: 



d f dx\ d f dy\ d 

dr \dr J dx \dr J dy 

d \ d f d \ d 

— {rcos9) ]^ + [ -^{rsm9) — 
or J ox \or J oy 

= (cos(-)A + (,to»)| 
d / dx\ d / dy\ d 

m ^ [m)di^\de)dy 

= -ir sin ^) — + (r cos ^) — (2.3.12) 

Ox Oy 
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/IV 
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~7 



d _i_ 




So, the nonlinear transformation (2.3.10) on the coordinates provides the linear transfor- 
mation on the frame , or basis vectorsat each point. 



Incidentally, notice that we can literally choose any non-stngluar matrix ^ to trans- 
form the basis. Therefore, the set of all possible such matrices forms a general linear 
group. And, because we have this freedom to choose any basis we want without changing 
the physics of the system we are describing, we have found a gauge group of the system. 
Simply by the definition we are using for manifolds, a gauge tnvariance has come about 
in our freedom to choose the coordinate functions however we want. This will be used in 



much more detail later, in section 6.8.3 



Moving on, just as the basis vectors transform according to (2.3.91, the components of 



a vector must also transform (cf the argument beginning on page 23 ). We naturally expect 
the components to transform under the inverse transformation as the basis vectors (again. 



see argument beginning on page 23 or equations (2.1.10) and (2.1.11 )). So, if x' 



then the inverse will be 



dxi 



x\x'^). So, the inverse of the matrix 
. So, the components of a vector, w*, will transform according to 

dx'' 



x-'{x }, 

^ will be the matrix 



dx^ 



(2.3.13) 



(we included the p because the components must be evaluated point-wise). 

To illustrate this, consider once again the transformation from polar to Cartesian, 



(2.3.10). Then consider some arbitrary vector with components (in polar coordinates) 
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(1, |). So, the vector is 

d TT d 



Br ' 2 80 



This will appear as on the graph below: 




Using (2.3.13 1, we have 



dx 
dr 



dx 



TT 
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TT 

sin 9 -\ — r cos 9 
2 



(i.f) 
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(2.3.15) 



So the vector in the transformed coordinates will be 

n d d 
2 dx dy 



(2.3.16) 
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which is the same thing we had before. 

So, we have basis vectors which transform with lowered indices, and components 
which transform according to the inverse transformation with upper indices. This 



matches exactly what we said at the end of section 2.2 justifying the convention declared 
there. 

Before moving on to the next section, we make a brief comment about how all of 
this may generalize. A tangent space is built by attaching a copy of M."' to each point on 
a manifold. We discussed the physical example of phase space - every spatial location 
(which is represented by a vector in M") a fly (or any object) is in throughout its trajectory 
has a velocity (which is a vector in another copy of M"), which is the tangent space. 

We also mentioned that tangent bundles are very simple examples of fibre bundles. 
Another way of saying this is that a tangent space is a very simple example of a fibre. 
While there is a more rigorous definition that we will discuss later in this series, a fibre is 
basically a space you attach at every point of another space. In physics, the base manifold 
is usually spacetime. We then attach a fibre at each point which corresponds to some 
physical property of the object that we are interested in. We already discussed the tangent 
space fibre as a useful tool for describing the objects velocity. The idea is that for each 
point on a spatial manifold, there is a point in the fibre which corresponds to a specific 
physical property. 

Another example may be a "rotation fibre". Consider a baseball flying through the air. 
At each point in space, there is a specific location (on the manifold), a velocity (a point in 
the tangent space fibre), and typically the ball has some rotational "spin" around an axis. 
We can therefore attach another copy of in addition to the tangent space. A point in 
this fibre will then correspond to an angular velocity vector. So, we have attached two 
fibres to the space manifold - one for its speed and another for its spin. 

As a more exotic example, consider a helicopter. The base manifold will again be 
three dimensional space, and there will naturally be a tangent space for its speed. We 
could then attach an additional fibre, not of M'^, but an fibre. So, at a given spatial 
point, we have a point in the tangent space telling us how fast the helicopter is moving, 
but we also have a point on which may tell us which direction the helicopter is facing 
at that location. 

While these are extremely, extremely simple examples of fibres and fibre bundles, you 
can see the basic idea. It turns out that the theory of fibre bundles provides an extraordi- 
narily powerful tool through which to make sense of nearly everything in particle physics. 
We only mention these ideas to give you an idea of why we are doing things the way we 
are now. We are building towards something much more general. 

2.3.4 The Tangent Mapping 

Before moving on, there is an important property of mappings between manifolds we 
must consider. Let A^i be a manifold of dimension rii and be a manifold of dimension 
n2. We denote the coordinate functions 0*^*^ on Mi as x\ i = 1,. . . ,ni (dropping the index 
in parenthesis labeling a particular coordinate neighborhood for notational simplicity), 
and the coordinate functions in M2 as y\i = 1, . . . , n2. 
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Now consider a smooth map / between these two manifolds: 

f -.Ml — >M2 (2.3.17) 

It turns out that we can use f : Mi — > to induce a well-defined map Tpf which 
maps from the tangent space at p G A^i to the tangent space at /(p) G M2- In other words, 

f:Mi^M2 ^ Tpf:TpMi^Tf^p)M2 (2.3.18) 

We call Tpf the tangent map at p. 

We find the exact form of this map as follows: Let Q be a homeomorphism from [a,b] C 
M to A^i, such that Q{t) e Mi'i t e [a,b]. Obviously this map is a one dimensional path 
in A^i (in other words it is locally homeomorphic to M). 

Then, use the coordinate functions x on A^i to map the point Q{t) G A^i Vr to M" (for 
simplicity, we assume for now that the entire curve is contained in a single coordinate 
neighborhood). This induces a homeomorphic mapp^ 

Q(r) =x(g(r)) : [a, 5] CM^M" (2.3.19) 

So, Q{r) is a parametric expression for a curve in M". We write it in bold because it is a 
point in the vector space M". 

We can use the curve Q(r) G M" to specify a tangent vector in by taking the 
derivative of Q at a point. If we take the curve in Mi defined by Q{t) to pass through 
p G Ml at, say, tq (so Q{to) = p), then the curve in M" defined by Q(r) passes through 
^{p) G M" at Q(ro). 



^^The reason we are using both Q and Q is merely that we are being careful. Formally, Q maps from [a, h] 
to Ml, which is coordinate free. We are going to want to take derivatives this map and we therefore want 
to put it into a space where we know how to take derivatives, namely M". So, Q maps from [a, 6] to A4i, 
and X maps from A^i to M". We merely define Q as the composite map which takes [a, b] directly to M", 
which we can easily take derivatives of. 
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Now, by taking a derivative of Q at tq, we have a vector in TpMi in the direction of the 
curve: 



dQfr) 

P, 



dT 



e TpMi (2.3.20) 



We can express this vector in terms of the frame induced by the coordinate functions 
X, so that the i*^ component is given by '^^'J^^ It=to" vector in TpMi defined by the 

curve Q is given by 



dQ' d 



dr dx'^ 



(2.3.21) 



(where the summation convention is being used), so the components are ^ with basis 
vectors 7^ at point p G A^i. 

So a curve Q : [a, h] — > M.i determines a well defined tangent vector in TpMi, and 
any tangent vector in TpMi can be specified by some curve. Now, using the map / we 
can map this curve in Ali to a curve in M2- Because / is smooth this will be well-defined. 
In other words, we have the composite map 

Q'(r) ^ /(Q(r)) (2.3.22) 

which maps [a, 6] — > M.2- Then, using the same argument as above, using the coordi- 
nates functions y on we can map this curve from to a curve in M". We call this 
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map 



Q'(r) = y(g'(r))=y(/(g(r))) 



(2.3.23) 




If the point p E Mi maps to the point f{p) e M.i, we can use Q' to define a vector in 
Tf{p)M.2- Namely, this will be the vector 



ciQ'(r) 



dT 



T=TO 



(2.3.24) 



We can write this vector in terms of the frame induced by y 



dr dy^ 



(2.3.25) 



So, with a smooth map / from A^i to M2, any tangent vector in Mi can be defined by 
some curve Q in A^i. This curve will naturally induce a well defined map of the tangent 
vector induced by Q at any point p to a tangent vector mM2 at the point f(p). This map 
is called the Tangent Mapping induced by /. It is also called the pushforward because it 
is dual to the pullback in the sense that the pushforward maps a vector to a vector while 
the pullback maps a form to a form. 
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We consider an example of this before moving on. Consider the manifolds S'^ and M^. 
Working with only one hemisphere of S"^, we will choose spherical coordinates. So a point 
in S*^ (minus one point at a pole, see section 2.3.2 1 will be mapped to some {6, 0). We then 
choose the standard Cartesian coordinates on M^. 

Now we want to define the map /. Obviously it will be easier to work with a co- 
ordinate representation of /. This is done in the natural way: a point p E A^i is 
mapped to f{p) by / and to x(j>) by the coordinate functions. Define the smooth map 
F : x(S'^ — {pole}) — > y(M^) (F is a two component vector in this case because is two 
dimensional) so that 



/ : p I > f{p) 



Y:^{p)^y{f{p)) 



The form of F will be 



•Xj ^ • • • ^ lAJ 



(2.3.26) 



(2.3.27) 



So, working with this "coordinate representation" of /, we can define a map from S*^ 
to M^. Let F tak^ 



X 



y 



cos 6 sin < 
1 — cos (, 
sin 6 sin c; 

1 — cos 4 



(2.3.28) 



This map is smooth and can be easily inverted to 

F-^'^ = e = tan-^ 
6 = 2tan"^ 



-1,2 



\Jx^ 



(2.3.29) 



which is also smooth. 

Now define a curve Q in S*^ as 



T 

■ 2r 



where re [0, tt] 

Now let's consider the point 
will define the tangent vector 



(2.3.30) 

[7r/4, 7r/2) (corresponding to r = 7r/2). In S*^ this 



dQ^ d 



dr 36 
d 



+ 



dQ^ d 



dr d(j) 



09 



+ 2- 



d 



(2.3.31) 



^^This bijective map takes points on the sphere to points on the plane. It is typically called a stereographic 
projection. 



53 



Now we can map the curve Q to using F, giving the curve Q' in \ 



Q'\r) 
Q'\r) 



x{t) 



cos(r) sin(2r) 
1 - cos(2r) 

sin(r) sin(2r) 
1 - cos(2r) 



(2.3.32) 



This will produce the vector 

, d /cos(r) sin(2r)\ d 

dT\ 1 — cos(2r) / dx 

_ _ 3 d _ 1 d 



+ 



d / sin(r) sin(2r) 
rf^ V 1 - cos(2r) 



d_ 
dy 



(2.3.33) 



It would be instructive to graph these vectors in the respective spaces to see that they do 
in fact line up. 

So to summarize, for any smooth map / between two manifolds A4i and J^2, ariy 
vector in TpMi can be mapped to a vector in Tf(j,)M.2 in a well defined way by using the 
tangent mapping Tpf at p. 

As one final (very important) comment for this section, it may seem that having to 
define a curve for every vector you want to pushforward is a bit tedious. It turns out that 
it is possible to define the pushforward without having to refer to a curve. This is done as 
follows. In M.I we have the tangent vector 



d 



T 



dQ' d 



in M 1 being mapped to the tangent vector 



dQ' 



dr 



dr dx^ 



dQ'' d 



(2.3.34) 



Q{to) 



dr dy"^ 



(2.3.35) 



?'(to) 



m.M.2- But, using (2.3.23 ) (but using the coordinate representation F instead of /), we can 
rewrite this as 



Q'(r) = F(Q(r)) =^ 
And then using the chain rule. 



(r) AF(Q(r)) 



T=TO 



(2.3.36) 



Q'{ro) 



dQ" 
dr 



d 



{F\Cl{r))) 



dr 
OF' dQi 

dx^ dr 



Note that we inserted the identity operator 

dQ^ 

dx^ 



(2.3.37) 



(2.3.38) 
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because the are simply spatial coordinates. 

Summarizing this, the tangent map takes the tangent vector 



Q{r), 



dQ 

dr 



dQ' d 



dr (9x* 



(2.3.39) 



Q(to) 



to the tangent vector 



dF dQ^ 



dx^ dr 



T=ro 



dQ^ dF' d 



dr dx^ dy' 



Now consider an arbitrary vector v with components v' = 



dr 



(2.3.40) 



. We can rephrase 



the previous paragraph about (2.3.391 and ( 2.3.40[ > by saying that the tangent map takes 
the tangent vector 



(0(r),v) 



dx' 



(2.3.41) 



to the tangent vector 



dF 

dxi 



■ dF' d 
dx^ dy' 



(2.3.42) 



Notice that the right hand sides make no reference to Q or Q', but does hold for any 
arbitrary vector \ E Mi. 

So, in general, we can say that for an arbitrary vector v'^i aip E Mi, the pushf orward 
induced by / is given by 



dx' 



■ dF' d 
dx^ dy' 



(2.3.43) 



You can approach the / : S"^ — > example above yourself using ( 2.3.43[ > to see that 
you do indeed get the same answer. 

2.3.5 Cotangent Space 

We have now defined manifolds and tangent spaces. Following what we did in section 



2.1 we now seek to define the dual space to a tangent space, which is called the Cotangent 



Space. The primary difference between what we did in section 2.1 and what we will do 
here for arbitrary manifolds is that everything is done point-wise. Just as each point 
p E M has its own copy of the tangent space W, each point also has its own copy of the 
cotangent space W*. 

To be clear, the cotangent space of a point has the same "vector space" structure as 
the tangent space. You can picture covectors in cotangent spaces just as much as you can 
picture vectors in tangent spaces. The difference is that one is the dual of the other (cf 
section [ZT| 
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In other words, for each point p e M, define the vector space A'^TpM as the g-linear 
antisymmetric product of 1-forms (covectors) in the cotangent space at p which map 
(TpA^)®"^ to M. Or, in more mathematical language, 

A'^TpM = {(j) : (TpA^)®^ ^R, q linear, antisymmetric} (2.3.44) 



All of the properties we discussed in section 2.1 will hold point-wise in this case, including 



the formation of the exterior algebra (at each point p), wedge products (at each point), 
vector space valued forms (at each point), transformation laws between covector bases, 
etc. 

However, we need a frame to provide a basis for A'^TpM locally just as g^fyj (p) formed 

a local frame for TpA4 locally. For a given set of coordinate functions 0^'-' (or components 
(0(*)'^, . . .)^), we have a frame for the tangent space TpAi given by 

d 



The basis for the cotangent space will then be given by the differential 

d(j)^'^'^ (2.3.45) 

such that (cf ( |2XT] >) 



The expression dcp^'^^'^ should be taken to be a differential (like the things you integrate 
over in Calculus I) just as much as „ .f^)^ is a derivative operator. We will talk about the 



connection between (2.3.451 and the differential from in calculus soon. This relationship 



is, in many ways, at the heart of differential geometry. 

All of the properties of forms discussed above in section 2.2 will hold true here. The 
biggest difference is that differential forms are defined only at a single point. 

As a quick example, consider a manifold with coordinate neighborhoods homeomor- 
phic to M^. If we set the coordinate functions to be Cartesian coordinates {<p^^^={x, y, z) ^), 
then the dual space basis covectors are dx, dy, and dz. So an arbitrary element of A'^TpM 
will simply be a real number. An arbitrary element of A^TpAi will be 

a{dx) + b{dy) + c{dz) (2.3.47) 

An arbitrary element of A^TpM. will be 

a{dx A dy) + b{dx A dz) + c{dy A dz) (2.3.48) 

An arbitrary element of A^TpM. will be 

a{dx Ady A dz) (2.3.49) 
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The action of, say, an arbitrary 2-form on some arbitrary element of TpAi ® TpJ^ is thenp^ 

{a{dx A dy) + b{dx A dz) + c{dy A dz)) (^A ^ (g) + ^ 
= bA-cB eR (2.3.50) 



d_ d_ 
dz dy 



And just as in section 2.2 we can express an arbitrary element of A'^TpM. as 



ql 



d(t)'^ A 



A ■ ■ ■ A 



(2.3.51) 



(compare this to (2.2.17)). Of course the summation convention is in effect. 

So, for some change of coordinates, how will the covector basis transform? Again, 
using the chain rule, the transformation law will be 



dcj)'- 



(2.3.52) 



which is the same transformation law for the components of vector as in equation (2.3.131 

wil 

de 



For example, the change of coordinates (2.3.10 1 will result in the cotangent space frame 

dr 



being transformed (linearly) to 



dx 



dy 



and 



dx , dx ,^ 

{cos 6)dr — {rsm9)d9 
dy, , dy 

d-/' + de'^^ 



= (sin 6)dr + (r cos 6)d6 

We can also write out the wedge product in terms of the new basis 

dx Ady = ((cos 6*) (ir — {r sin 9)d6) A (^{sin 9)dr + (r cos 
= (r cos^ 9 + r sin^ 9) dr A d9 
= r dr Ad9 



(2.3.53) 



(2.3.54) 



We recognize both sides of this expression as looking a lot like area terms in their respec- 
tive coordinate systems. 

2.3.6 The Fullback of Differential Forms 



Now we can generalize what we did in section 2.2.3 to differential forms. We will 
follow what we did there fairly closely. You are encouraged to go back and re-read that 
section in parallel with this one. 



34' 



This is not the most general vector - just a random example. 
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Consider some map / from manifold Aii (of dimension ni) to manifold A42 (of di- 
mension 71,2): 

f:Mi~^M2 

In general this map cannot be uniquely inverted. 

But, while we cannot in general invert the mapping /, it turns out that / does create a 
well defined map from the cotangent space of Ai2 to the cotangent space of A4i. We denote 
this induced map f*, and say 

r ■■ ^'Tf^p)M2 A'TpMi (2.3.55) 

Note that / mapped from 1 1-^ 2, whereas /* maps from 2 1-^ 1. 

We define /* as follows: given some tensor product of q vectors in TpAii, which we 
will denote (m^^\ m.^^\ . . . , m*^"^)), we know that it is mapped to a tensor product of q 
vectors in Tf(.p^M.2 by the tangent mapping Tpf as 

(m(^), m(^), . . . , m(^)) ^ ( r,/(m(^)), r,/(m(^)), . . . , r,/(m(''))) (2.3.56) 

(where Tp/(m(*)) is understood to be a vector in Tf(p)M.2)- Acting on the right side of 
(2.3.56) with a g-form in A'?Ty(p)A^2 (denoted 0g) maps it to M: 

(/),(T,/(m(^)), r,/(m(^)), . . . , r,/(m(''))) ^ M (2.3.57) 

The map will, as state above, map a g-form (j)q in AT/(p)7Vl2 to a g form in MTpMi, 
which we denote ipq. We define the exact action of /* on 0q G h.'^Tf(p)M.2 as follows: 

(/>g)(m('),m(^),...,m(^)) = ^/^^(m^'), m^^), . . . , m^^)) 

= 0,(T,/(m(^)), r,/(m(^)), . . . , r,/(m(''))) (2.3.58) 



The remainder of the analogous discussion in section 2.2.3 will hold for the meaning 
and application of (2.3.581. 

One very important property is that, in the case where g = (zero forms, or real 
functions), the puUback "commutes" with the action of a one form: 

(r0o)(x) = 0o(/(x)) (2.3.59) 

As an illustration we calculate an example. Once again take f : M.i — ^ M.2, where 
X are the coordinate functions on M.i and y are the coordinate functions on M.2- The 
coordi nate re present ation o f /, again denoted F : ^{Mi) — > y{-M.2) will be used as in 
section 2.3.4 Using (2.3.431, a vector v^i^^ in Mi is mapped to a vector ^■'ffj^ in M2- 



We want to find the pullback of the one-form dy^ G Is}Tf(^p)M2, which will be a one-form 
{rdy^) G h^TpMi. Using the definition ( |2.3.58| >, we have 

= dy-'v 



dx^ dy^ 

Qpk 
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Or in terms of the basis vectors 



d_ 



(2.3.61) 



The meaning of this is as follows: {f*dy^) is a form in (the x space), and will therefore 
be some linear combination of the dx^'s. We will call it 



(2.3.62) 



for some Al (proportionality terms). If we act on a unit vector ^ with this (using (2.3.61 1), 
we have 



_d_ 
dx'' 



Mdx' 



d 
dx^ 



MSI 



^ ~ dx^ 



(2.3.63) 



So, using (2.3.62), 



{fdy^ 



dx^ 



dx" 



(2.3.64) 



In other words, (2.3.64 1 is telling us how to express a form in y space in terms of forms in 
X space. 

You are already quite familiar with all of this, even though that may not be apparent. 
What we essentially have is some space with coordinates x, and then a map which assigns 
a point the space with coordinates y for every point in x. We call this map F, and it has 
the form 



y\x\x\...) = F\x\x\...) 



(2.3.65) 



(note that this is a map from x space to y space). We can then take the differential of this 
expression, getting 



OF' 
dxi 



dx^ 



(2.3.66) 



which is simply a slightly less formal statement of (2.3.641 - relating the expression of 
forms in x space (A^i) to y space (7^2)- We have done nothing more than formalize the 
partial derivative. While this may seem like a lot of work for such a simple concept, 
remember that we are going to be doing calculus on much more general spaces than M", 
and therefore we will need all the formalism we are currently developing. 

2.3.7 Exterior Derivatives 

We are now ready to discuss an extremely important idea that will provide a major 
part of the the framework for physics. The Exterior Derivative, which we denote d, is a 
function which maps g-forms to {q + l)-forms: 



d : A'^TpM 



(2.3.67) 
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We will begin with the simplest example and build the general definition from there. 
We will work on a manifold Ai with coordinate functions x. Starting with an element 
00 G A^TpAi (a zero-form, or merely a real function), we define the exterior derivative as 

- -^"^'dx^ (2.3.68) 



dx' 



Note that if we transform the coordinate covector basis basis (using (2.3.9) and (2.3.521), 
we have 

^ ^ and dx^ - ^^df (2.3.69) 



So 



dx^ dx^ dy^ dy^ 



dx'' dy^ \ dx^ dy 

- dy^ ^'"^y 



^dy^ (2.3.70) 



dy^ 

which is the exact same form as in the x coordinates. So, the definition of the exterior 
derivative is coordinate system independentl That is an extremely important property which 
will prove extraordinarily useful later 

f TA70 nar^ r\n a iror'f/Tr -ir — -jiJ - 



Because d(j)o is a (0 + 1) form on M, we can act on a vector ^ = ^ to get 



= A = (2.3.71) 

This expression is interpreted as being the derivative of (po in the direction of v, or the 
directional derivative of 00 . We will discuss the geometrical meaning of this (and other 
results) shortly, though you should already be somewhat familiar. 

Now we generalize the definition of d. We will take d to have the following proper- 
tiesS 

1) Leibniz - d{fg) = fdg + gdf, where /, g are functions. 

2) Linear - d{af + bg) = adf + bdg, where /, g are functions. 

3) d{A''TpM) C Ai+%M 

4) Generalized Leibniz - A ipq) = {d(j)p) /\'4>q + (— l)P0p A {dipq) 

5) Nilpotent - d'^4)p = d{d(f)p) = for any p-form (pp. 

With a little tedium, it can be shown that the exterior derivative of an arbitrary p-f orm 



(cf (2.3.51)), is 



^ dx^ A dx'' A dx'-' A ■ ■ ■ A dx'^ (2.3.72) 



dx^ 

^^Each of these can be proven very easily and straightforwardly, and we therefore won't write out the 
definitions. You are encouraged to convince yourself that each of these properties are true. 
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(Check the indices very carefully!), where the summation convention is, as always, in ef- 
fect. The wedge product will ensure that this new {p + l)-form will still be antisymmetric. 

Once again we can see that this (2.3.721 is coordinate system independent. 

The nilpotency of d: 

= (2.3.73) 

is an extremely important property that will be discussed in much, much greater depth 
later in these notes as well as throughout the rest of this series. 

Also, consider a map f : Mi — > M.2- Then let {TpM.i)®'^ be an arbitrary tensor 
product of q vectors in TpM.i. We can then take the exterior derivative of the induced 
pullback: 

d{r(t^,){{TpMiD = {d<p,){Tpf{TpMir'^) 

= r{d<p,){{TpMir'^) (2.3.74) 

So the exterior derivative and the pullback commute with each otherj^ 

Before moving on we will consider a few examples of exterior derivatives in the usual 

three-dimensional Euclidian space we are familiar with. 

Consider a real function / in M'^. This is just a zero-form, so the exterior derivative will 

be as in ( |2.3.68| >: 

df = -J—dx^ = -J-dx + -J-dy + -J-dz (2.3.75) 
ax* ox ay oz 

This is the exact expression for the gradient of the function /, usually denoted V/ (re- 
member that the dx' should be thought of as unit vectors in the cotangent space, which 
is simply a vector space - so fix' is analogous to the row standard basis unit vectors. So 
( |2.3. 75[ ) would be written as 

dx dy dz 

in the more familiar vector calculus notation, where and k are understood as row 
vectors). 

Next consider the two form 02/ 

02 = 4>x dy A dz + (j)y dz A dx + 02 dx A dy (2.3.76) 

Taking the exterior derivative, 

^ + ^ + ^)dxAdyAdz (2.3.77) 
ox ay oz J 

If we recognize 02 as a vector in a three dimensional space, this expression is its diver- 
gence. 



''^This isn't a proof, but it illustrates the point well enough. A more formal proof is a straightforward 
though tedius application of the chain rule. 
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Now consider the one-form 0i 

01 = (p^dx + (pydy + (pzdz (2.3.78) 

Taking the exterior derivative, 

^ ^dy Adz + { ^ ]dz Adx + ^ ]dx Ady (2.3.79) 



dy dz J \ dz dx J \ dx dy 

Which we recognize as looking a lot like the curl of the vector field 0i. 

These examples allow us to see that, in three dimensions, the nilpotency of d {d'^cpo = 0) 
is equivalent to V x (V0p) = 0. And, d'^(f)i = is equivalent to V ■ (V x 0; ) = 0. 



As a final comment, it probably isn't clear how "unit vectors" like dy A dz in (2.3.791 
relate to the unit vectors you are used to seeing from vector calculus. We will discuss this 
relationship later. 

2.3.8 Integration of Differential Forms 

As we mentioned above, a form dcp is very much a differential, just like the things 
you integrate over in introductory calculus. We have now come to a point where we can 
discuss how forms are integrated. 

However we must first discuss the notion of orientation. As an example to motivate 
this discussion, consider the Mobius Strip M 
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We can treat this as a manifold (with a boundary - don't worry about this detail for now) 
and put the usual x and y Cartesian coordinates on it. 




Now let's say we want to integrate some real function / (a 0-form) over the entire surface 
of Mobius Strip (we are taking / to be a function of an abstract point on the manifold so 
it is f{p) rather than f{(t>'^^{p))). 

I f dxdy 
Jm 

But consider moving the coordinate system (the x and y axes) around the strip: 
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After returning to where we started the coordinate system has reversed orientation - it 
started as a right handed system and has ended up a left handed system. There is clearly 
no way to rotate the axes back to their original configuration without rotation out of the 
space or going all the way back around the strip. 

So, because moving around Ai has changed the orientation of the strip, and because 
there is no way to transform the new coordinates back to the old coordinates, the Mobius 
Strip is an unoriented manifold. Obviously this ambiguity will cause problems for inte- 
gration - at any point there are two inequivalent coordinate systems to choose from, and 
we don't have any well defined way of choosing one. Of course, we could choose a single 
coordinate neighborhood on M, and the coordinate system would be well defined, and 
the integral would make sense. 

The standard cylinder C 
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will not have any problem with orientation. Moving the coordinate axes around this 
brings them back to where they started. Therefore C is an oriented manifold. 

Other examples of oriented manifolds are the n-sphere, the ra-torus, M". Examples of 
non-oriented manifolds are Mobius Strips, Klein Bottles, real projective planesj^ 

Now we want to state this in a more mathematically well-defined way. In order to do 



this, we must finally discuss the brief comment we made on page 28 about the observation 
that wedge products are expressible in terms of determinants. Given some n-dimensional 
manifold with some particular coordinate neighborhood Ui and coordinate functions 
<p^^^ = X, we know that A'^Tp^A is one dimensional (cf equation (2.2.141). So, an arbitrary 
element of A"TpX can be expressed as 



f dx^ Adx^ A--- A dx" e A"TpM 



(2.3.80) 



where / is some real function. 

Now let's assume p is also in the coordinate neighborhood Uj with coordinate func- 
tions y. We can easily form the linear transformation from x to y (2.3.52). However, 



because A"TpA^ is one dimensional, and therefore the transformed ra-form will be pro- 
portional to the untransformed n-form. We denote the constant of proportionality the 



you aren't familiar with these manifolds that's not a problem. We won't be discussing them further, 
though you are encouraged to reference them on your own. 
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Jacobian of the transformation, and it will be equal to the determinant of the transforma- 
tion: 



f dx^ Adx^ A ■ ■ ■ A dx"" 



f ^dy'^ A ^df' A ■ ■ ■ A ^dy'- 
fdet (^^dy'Ady^A---Ady'^ 



For example, consider the three-form in M? (with / = 1) 

dx Ady A dz 

We can transform this to spherical coordinates according to 

X = r cos 9 sin 
y = r sin 6 sin 

z = r cos 6 



(2.3.81) 



(2.3.82) 



(2.3.83) 



The transformation matrix will be 



dx^ 
dy' 



1 dx 

dr 


dx 


dx\ 

d<f> \ 




de 




dy 


dy 


dy 




dr 


de 


d<l> 




dz 


dz 


dz 1 




\dr 


de 





^cos Q sin ( 
sin Q sin ( 

cos <i) 



— r sin Q sin i 
r cos Q sin 




r cos Q cos ( 
r sin Q cos ( 
— r sin I 



(2.3.84) 



And the Jacobian (determinant) of this matrix is 



sin I 



So 



dx Ady Adz = sin (f) dr AdO A 



(2.3.85) 



(2.3.86) 



This makes perfect sense from standard vector calculus - we can recognize both sides as 
the volume element in their respective coordinate systems. 

Now, recall that with the Mobius Strip above, the problem was that when you go 
around the strip the coordinate system changes orientation - in other words dx A dy — ^ 
—dx A dy. So, if we choose a point on the strip, it will be in the intersection of two different 
coordinate neighborhoods. And while the two-forms in each coordinate neighborhood 
will be proportional to each other, they will differ in sign. According to our above result 



(2.3.81 ), this is equivalent to saying that the Jacobian (the determinant of the transforma- 
tion) is negative. 

Therefore we have the following definition - A manifold A4. is orientable if, for any 
overlapping coordinate neighborhoods Ui and Uj, the Jacobian of the transformation be- 
tween them, denoted J, satisfies J > Oj^ 



^^It obviously makes no sense to say that it must satisfy J > because if J = then the transformation 
is singular and it cannot be inverted. 
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As a final comment about orientability, notice that if M. is orientable, then according 
to what we have said above there must exist an n-form 0„ which never vanishes. This 



form is called the Volume Form, and we denote it 0y. Both sides of (2.3.861 are volume 
forms in their respective coordinates. A volume form will play the role of the integration 
measure when we integrate over ^A. Any oriented manifold will admit two different 
equivalent classes of volume forms - one with a relative positive sign and another with a 
relative minus sign. We call one of them right handed and the other left handed. From 
now on, unless otherwise specified, you can assume that all manifolds we work with are 
oriented. 

We now turn to the idea of integrating a form over a manifold. Let's say we have some 
function F such that 

F :M — >R (2.3.87) 

(F is just a real function on the manifold). We will, for now, only worry about integrating 
over a single coordinate neighborhood Ui Ai. To integrate F over Ui we first multiply it 
by the volume form (py which acts as the integration measure for the integral. The volume 
form will usually be some function h (In three dimensions it was /i = 1 in Cartesian 
coordinates and /i = sin in spherical coordinates, cf ( 2.3.86[ >) times the wedge product 
of all basis covectors in some coordinates frame: 

(j)v = hdx^ Adx^ A---A dx" (2.3.88) 

So, the integral is written as 

/ F{p)^v (2.3.89) 

(where p E Ui). While the integral will only be well-defined on the entire manifold if 
it is oriented, we can take a single coordinate neighborhood to be oriented because it is 
homeomorphic to M" - if is not oriented, then the orientation, or handedness, of (pv 
will not be well defined in some region, and the inte gral ca nnot be globally defined. 



Once we have a function F and a volume form (2.3.881, with the frame covectors rfx* 
in a particular order, we define the integral as 

I F{p)(t)v= I F(0-^'«(a;\x2,--- (2.3.90) 

JUi J<p('1{U,) 

Where the integral on the right hand side is understood as a normal integral as you 
learned about in an introductory Calculus course. Notice that the integral on the left 
hand side "lives" on Ui. F is a function directly from A4 to M, and the volume form (py 
lives on the manifold. However the right hand side "lives" on the open subset of M" that 
the coordinate functions map to. Because <p^^^ are homeomorphic they are invertible, and 
we can take any point in M" and map it back to the manifold using (f)^^'^^\ 

So the point is that an integral on the manifold is actually done on the open subset of 
that the coordinate functions map to. This isn't particularly profound - the manifold 
is an abstract, coordinate free space. An integral needs more structure to make sense. 
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Therefore it only makes sense that we take the integral down to a space where we are 
used to doing integrals. 

Also notice that the integration measure dx^dx^ ■ ■ ■ dx"' is ordered the same as the vol- 



ume form (2.3.88). To understand why this is important, recall that a particular wedge 



prod uct is a unit vector in a particular vector space of the exterior algebra (see section 
2.2.1 1. And just as i and — z represent two different (opposite) directions in M'^, so also 
dx A dy and dy Adx {= —dx A dy) represent two different (opposite) directions in the vector 



space A^TpM^. The discussion around equations (2.3.75) through (2.3.79) may help clarify 
this. So, the volume form on the left side of (2.3.90 ) defines the orientation of the coordinate 
system being integrated over. Therefore if we were to switch two of the frame covectors, 
the volume form would be 



-h dx^ A dx^ A ■ ■ ■ A dx"- 



And the integral would be 



F(0 



, x"')hdx^dx'^ ■ ■ ■ dx 



(2.3.91) 



(2.3.92) 



where the integral on the right hand side is understood as being over a coordinate system 
with the opposite orientation. This is a generalization of the fact that 



f{x)dx 



f{x)dx 



(2.3.93) 



from introductory calculus. 

Moving on, it may appear that ( 2.3. 90[ ) depends on the coordinates chosen. However 
it does not. Let's say we have a smooth transformation from coordinate neighborhood 
Ui to Uj with coordinates y, and for simplicity f/,; and Uj cover the same portion of the 



manifold. The volume form will transform according to (2.3.81 ), and therefore in the new 
coordinates the integral will be 



/ 

JUi 



F{p)cf> 



V 



, 2;^ . . . , dx^ dx^ ■ ■ ■ dx"" 
F{<j,-'^^^\y\y\...,y")hdei ( dyHy^ ■ ■ ■ dy^ (2.3.94) 



dy^ 



where, because of the smoothness of the transition functions between coordinate's (the 
i/? '•*■'•'), we could also have expressed F in the new coordinates as 



F{x\y\ y\..., y"), y\ . . . ,y"), . . . , x"{y\ y\ . . . , y")) 



(2.3.95) 



So, in ( 2.3.94[ ) we find the genera l equation for a change of variables in an integral. 
You can see this clearly from (2.3.86) and (2.3.54). In (2.3.54) we had the volume formp^ 



^^Volume in two dimensions is the same as area - area is a two-dimensional volume. 
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dx A dy. If we wanted to integrate a function / over a coordinate neighborhood, it would 
have the form 



f dx A dy = / f{x,y)dxdy (2.3.96) 



The right side of (2.3.54) then corresponds to the integral 

/ frdrAd9= ! f{r,9)rdrd9 (2.3.97) 
This corresponds exactly to what is done in an introductory vector calculus course. You 



can do the same thing for (2.3.86 1. 



So far all we have discussed in this section is the special case in which we are only in- 
tegrating over a single coordinate neighborhood Ui. Now we can discuss how to perform 
an integral over multiple coordinate neighborhoods. 

Take some atlas {{Ui, <p^^^)} such that each p e M is covered by a finite number of 
coordinate neighborhoods (we assume this is always the case). Then define a set of dif- 
ferentiable functions ei{p) satisfying the following: 

1) e,(p) =Oifp^f/,. 

2) < tiip) < 1 Vpandi. 

3) J:^e^ip) = lypeM. 

Such a set of functions is called a Partition of Unity. 

The integral of a function F across the entire manifold can then be written as 

/ F<Pv = J2 f '^iP)f(P)^v (2.3.98) 



It can be shown that the integral defined by (2.3.981 is independent of the choice of co- 
ordinate functions on any of the neighborhoods, and is independent of the partition of 
unity. 

For example, consider the manifold and the function F(9) = cos^ 9. Of course we 
can do the integral in the normal way: 



2n 

2 



d9 cos' 9 = 71 (2.3.99) 







But we can reproduce this using a partition of unity. Let 

ei(^) = sin2(^/2) 

t2{9) = cos^(0/2) (2.3.100) 

Then the integral is 

/ d9cos^9 = / d9sm'^i9/2)cos^9+ / d9 cos\9/ 2) cos^ 9 

JS^ Jo J-TT 

= 1^+1^ = -^ (2.3.101) 
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So, on any orientable manifold, a function can be integrated across the manifold by 
mapping the entire integral into a coordinate system. The volume form on the manifold 
determines the orientation of the integral, and the integral is independent of the coordi- 
nate functions chosen - the Jacobian allows us to transform the integral from one coordi- 
nate system to another. If we must integrate across multiple coordinate neighborhoods, a 
partition of unity allows this to be done in a well defined way, and the integral does not 
depend on the value of the partition of unity. 

2.3.9 Vector Fields and Lie Derivatives 



We have discussed tangent spaces at every point p & Ai. However it will also prove 
useful to discuss vector fields on manifolds. A vector field is a smooth map from Ai to 
the tangent bundle TAi. We define a smooth map ^ from Ai (or a subset of Ai) into TAi, 
called a vector field, as 

P ^ (P, vb)) = ap) e T,M (2.3.102) 

In other words, at each point p, v selects one single vector out of the which com- 
prises the tangent space. We then demand that this map be smooth in the above sense. 

An example of this is a fluid moving through some volume. The manifold will be the 
volume the fluid is moving through. Any single drop of fluid could be moving in any 
direction in the tangent space (there are n degrees of freedom in an M" tangent space). 
However, a particular vector field, or in other words a particular "flow" of the fluid, will 
assign a unique velocity at each point in the manifold. So, for each point p, we can assign 
a velocity v{p). 

Another example familiar in physics is an electric field E. The manifold in this case is 
usually just space, and the vector field assigns a specific "magnitude and direction" to the 
electric field at each point. 



In section 2.3.4 we had a fairly in-depth discussion of the relationship between a curve 
Q{t) E Ai 3 Q : [a,b] — > Ai and a tangent vector at some point on that curve. You are 
encouraged to reread that section now and familiarize yourself with those ideas. 

There, we had a single curve defining tangent vectors at any point tq along the curve. 
We also mentioned there that any vector can be defined by a curve passing through that 
point. We will now exploit this one to one correspondence between curves and tangent 
vectors. 

Let us now define, instead of a single curve Q E Ai, dL family of curves, which we call 
q{T,p) E Ai, where r E [a, b] and p E Ai, and r plays the exact same role as before. We 
will define this so that E [a, b], and so that ^(0,^) = p. In other words, this is a family 
of curves defined for every point p E Ai, parameterized such that the r = point is 
the identity map. Then the point q{r',p) will be another point on Ai further along the 



curve which passes through p. Using the same type of argument as in section 2.3.4 this 
construction will define a tangent vector at every point of Ai (by taking the r derivative 
of g(r, p) at the point p), and consequently we have a vector field on Ai defined by our 
family of curves. 

One obvious necessary condition is that for every point p E Ai, there can be one and 
only one curve passing through p - no two curves can intersect. If there is an intersection 
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then there will be two different tangent vectors associated with that point, which we don't 
want. 

So, given a family of curves with no intersections, we have a well-defined vector field 
- a single tangent vector at each point in ^A. We can, however, approach this in the 
opposite "direction". Let's say we have a vector field already defined on ^A. This implies 
that there exists a family of curves g(r, p) which define this vector field. In other words, 
the r derivative of g(r, p) at a given point p will define the tangent vector at that point. 
We will use the coordinate functions x as in 2.3.4[ resulting in the mapping 

q(r, xb)) = x(g(r, p)) : [a, b] — > (2.3.103) 

If we define a tangent vector at p to have components f *(x(j))) (so the vector in the vector 
field at each point is v(x(j))) = i''(x(]:>))i^)F^then we have 

"^^^^""^^^^^^ = v\qirMp))) = v\mr,p))) (2.3.104) 

or 

rfq(r,x(2))) 



dr 



v(q(r, ^{p))) = v(x(g(r, p))) (2.3.105) 



(compare this with ( |2.3.20 >). Notice that this is a first order differential equation for q 
subject to the boundary condition 

q{OMp))=^ip) (2.3.106) 

In other words, the vector field v^{x{p)) on has produced a differential equation, the 
solutions of which will be the curves which "flow" along the manifold and define the 
vector field. 

As an example, consider the vector field In Ai = M."^ given by 

d d 

v = -?/7r + ^7r (2.3.107) 
ox ay 

So, we have = —y and v'^ = x. The differential equations for this will be 

dq^ 2 

~T~ = -y = -Q 

dr 

^ = X =q^ (2.3.108) 

dr 

Taking the second derivative of the first with r and plugging in the second, we get 

d'^q^ dq^ 



dr"^ dr 



-q^ (2.3.109) 



'^''We are letting the components depend on the coordinates of p rather than on p for simplicity - it will 
be easier to keep everything in the same space, namely M". 
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which has solutions of sin's and cos's: 



A sin T + B cos r 
C sin T + D cos r 



(2.3.110) 



We will make our boundary condition at r 
the conditions (cf ( |2.3.108D ): 



dr 



T = 



T=0 



T=0 



It=0 



the point (xq, ?/o)- We therefore have 



-yo 

Xq 

yo 



(2.3.111) 



This leads to the general solution: 



q\r) 
q\r) 



—Uq sin T + Xq cos r 

Xq sin T + Uq COS T 



(2.3.112) 



This is simply the parametric equation of a circle in M?, starting at (xq, yo) and centered at 



the origin (in other words, a circle around the origin with radius r 
family of curves will be the set of such circles with all radii. 



Notice in (2.3.112 1 that we can rewrite it as 



COST 

sin r 



— sm r 
cos r 



Xq + Uq). The total 



(2.3.113) 



The matrix on the right hand side is an element of the two dimensional representation of 
the Lie Group 5*0(2) (cf [40]). So the vector field \{p) induces curves which take a point 
in M. and act on it like 5*0(2) - rotating it around the origin while preserving the radius. 

So, a family of curves over the entire manifold M. is equivalent to a smooth vector 
field over the entire manifold M.. We refer to the family of curves as the flow associated 
with the vector field v(x(]9)). 

Notice that, in general, a flow q{T,p) induces a Lie Group. For any point p E M, we 
have 

1) Closure - g(r, qir' ,p)) = q{T + t',p). 

2) Associativity - obvious. 

3) Identity - q{0,p) = p 

4) Inverse - g(— r, q{T,p)) = q{T — T,p) = q{0,p) = p 

Furthermore, this group (for a single point) is an Abelian Lie Group. It will look locally 
like an additive group on M, but not necessarily globally. In the above example it was 
5*0(2), which looks locally like M, but the identification of the r and the t + 2n element 
shows that it is not globally like M. 

Now consider an infinitesimal perturbation away from p using the path q (with in- 
duced vector field v(x(j))), where p — > q{e,p) (where e is tnftnitesimally small). We want 
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to do a Taylor expansion around r = 0, so we use the coordinate functions x to map to M, 
so now x{p) — > q(e, x(j))). Now, doing the Taylor expansion. 



= x(;j) + ev(x(p)) + ■ • • 



+ 



r=0 



(2.3.114) 



(where we have only kept terms to second order in e). So we see that, geometrically, the 
vector V acts as a sort of "generator" at x(]9), pointing in the direction of the flow. 

Furthermore, consider now a finite (rather than infinitesimal) transformation from 

x(j)): 



q(f^,x(]9)) 



E- 



1 g(iq(r,x(p)) 



n=0 

d 



d 

n\ \ dr 



r=0 



dr 



+ 



T=0 



e'^d-q(r,x(2)))|^^^ 
e""x(]o) 



(2.3.115) 



To get the last equality, we have used the general equation that, for a vector v and operator 
V with eigenvalue V (so V\ = V\), we have 



V 

e V 



V 

e V 



(2.3.116) 



Operating on q(r, x(2))) with ^ gives the tangent vector v, with only the starting point 
x(2)) left. 



So the content of (2.3.115 1 is that if we start at an arbitrary point x(j)) on the manifold 
(mapped to its coordinates), the action of the class of curves g(r, p) will move that point 
according to the exponentiation of the vector ^{p) at the point p. In other words, the 
vector \{p) is the infinitesimal generator of the flow at the point p. 

This should remind you of what we did in [40J with the generators of Lie groups. It 
turns out that this is in fact the exact same thing. The only difference is that we are ap- 
proaching this geometrically rather than our purely algebraic approach in the first paper, 
and that our discussion right now is about general manifolds. 

Moving on, it is obvious that we have the following properties, which show the corre- 
spondence between using q and using the exponentiated vectors: 



1) q(0,x(;>)) = i^{p) 

3) ^iJ,^.{.r',p)) =q(r + r 



,0v 



d , 



,P) 



Ap) 



ve x(j)) = v(x(j))) 



,(r+r')v 



So we can see that these are indeed equivalent descriptions of the relationship between 
vector fields and flows. 

There is something interesting going on here - if we are at a point p E A4. and we want 
to move along the "flow" generated by a vector field, we can follow the flow line through 
p as far as we want knowing only the vector at p. We don't need to know the vector at 
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any other point except where we start. As we mentioned above, this was the same situa- 
tion with the generators of a Lie group in [|40|. The generators described the behavior of 
the group near the starting point (the identity), and exponentiating the generators (along 
with some parameter) moved you anywhere in the group. Here, the tangent vector at p 
describes the behavior of the vector field near the starting point (the point p), and expo- 
nentiating the tangent vector at p (along with some parameter) moves you anywhere in 
the manifold along the flow line through p. We have arrived at extremely similar ideas, 
first through purely algebraic means and now through purely geometric means. The dif- 
ference is that before the generators satisfied a specific algebra (defined by the structure 
constants), and there were multiple generating tangent vectors at each point. 

We will continue to explore the similarities between these two situations in this sec- 
tion, and in the next section we will see how we make the jump to a true Lie group 
structure, thus tying what we did in the group theory section of [|40| with what we have 
done so far here. 

Moving on, let's say we have some manifold A4 with two different vector fields, v^^^ = 
f and v^^) = v^^^''^-£^, which are not necessarily related to each other in any way. 

These two vector fields will naturally have associated families of curves, which we 



denote q^^^ (r, p) and g*-^^ (r, p), respectively. These will naturally satisfy (2.3.105 1 separately 
(mapping everything once again with the coordinate functions): 



dq(^)(r,x(p)) 



v(^)(q(^)(r,x(j,))) 



dr 

dT 



(2.3.117) 



So at the point p, there will be two different tangent vectors and two different flow lines. 
Let's take the v^^^ tangent vector and the q*^^^ flow line. Our goal here is to calculate 
the derivative of v*^^)(q'^^)(r, x(j)))) along the curve q^^^ (so we are, for this problem, not 
immediately interested in the vector field v^^^ or the curves q*^^)). 

As in calculus I, we take any derivative of v*^^) at point p by comparing it with v^^^ at 
a nearby point q^^\e,p) and then taking the limit as e ^ 0: 

v(^)(q(^)(6,x(p)))-v(^)(x(p)) ^ 



Equation (2.3.118 1 makes sense in principle, but it is not well defined. There is an ambigu- 
ity in how to compare these two vectors. The problem is that v(^^(x(j))) is in the tangent 
space TpM, while v(^)(q(^)(e, x(p))) is in the tangent space Tg(i)(^ p)A^, and there is no well 
defined way of comparing these two vectors. 

This problem is more obvious on curved manifolds, where there is no natural way at 
all compare vectors in two tangent places. 

The solution is to use the tangent map (cf section 2.3.4[ >. The problem is that the vectors 



aren't in the same tangent space, and so we want to move one of them to the tangent 
space of the other. We have a smooth function q^^^ which maps p to q^^\e,p) (or the map 
q^^\—e, p') which will map (f^^ (e, p) back to p), and therefore we can use what we learned 
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in section 2.3.4 to make (2.3.1181 make sense - by comparing the two vectors in the same 



tangent space, TpM . 

We make a warning here: the notation we will use in the next few pages will be 
hideous. The meaning of the notation should be very clear at each point, but because 
of how many terms we will be writing in each expression it will appear very muddled. 
Bear with us and trust that by the end everything will simplify very nicely. 

As we said above, the vector v(^)(q(^)(e, x(j>))) is at the point q'^^\e,p) G M. We 
want to take the vector at this point back to the point p. We do this by using the 
tangent map of q^^\—e,p), which we denote T^(i)(e,p)Q''^K~'^^ Q^^K'^^P))- So, acting on 
the vector \-'^^\q'^^\e,:x.{p))) with the tangent map T^w^^^^p^q^^^—e, q^^\e,p)) will map 
^-^^(e, x(j)))) back to a vector in the tangent space TpM. And therefore we can write 



the definition of the derivative of the v*^^^ vector field along the q^^^ curve as 



/:,(.)v(^)(x(;,)) 



lim 



(2.3.119) 



Admittedly this is a bit of a monstrosity, but it can greatly simplified as we will see. All 
it means is that we are comparing elements of a vector field at p and at q^^\e,p), taking 
the difference and dividing by the "distance" between them (e). We are using the tangent 
map in order to make the comparison well defi ned. 

The derivative C^{i)\-'^^\-x.{p)), defined by (2.3.1191 is called the Lie Derivative of 



v(2)(^x(j))) at the point p in the direction defined by the vector field v*^^-*. Note that the 
subscript is the vector field which defines the curves along which we take the derivative. 

We now begin to simplify ( 2.3.119[ > by explicitly writing out the tangent map. Because 
we are working with only an infinitesimal displacement e, we only need to keep terms to 
first order, as in (2.3.114). First, using ( 2.3.114[ > we write out the vector 



v(^)(q(^)(e,x(2)))) = v(^)(x(2)) + ev(^)(x(p))) 



v(^)(x(p)) + ev^^^'*(x(p)) 



gv (^)(x(j^)) 



+ 



(2.3.120) 



Keep in mind that this is a vector in the tangent space Tg(i)(^ ^^A^. So we want to map this 
vector back to TpM using the tangent mapping Tg(i)(^ p)g(^)(— e, q''^\e,p)). 

So, using (2.3.43 > , wher e ( 2.3.120[ > plays the role of the original components (which 
were denoted in (2.3.43)) and~q^(e, x(2))) plays the role of the transformation func- 



tion (what was denoted in (2.3.431), we have (using the same coordinate functions 
and therefore the same basis vectors) that the tangent map gives the the vector with k^'^ 
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component (compare this closely to ( 2.3.43[ >): 



,(2) J, 



x(p)) + e^;(^)'«(x(p)) 



dx 



{x''{p)-ev^'^'\^{p))) 



dx' 

P^HAp))6^ - ev^'^H^{p) f'''2f'^^ 



dv^^^'^'i^ip)) 



dx^ 



+ eS'v^'^^\^ip)) 



5^(2) J (x(p)) 



dx' 



(2.3.121) 



where we have suppressed the arguments for notational clarity and used (2.3.114) to get 
the second line. Rewriting this as a vector equation (instead of merely components), we 
have 



dx^ dx^ 



(2.3.122) 



Now, plugging this into (2.3.119 1, we have 



dx^ 



+ V 



dx^ 



M) 



dx^ ) 



lim -e ( — i;(2)'-' — — ; — |- V 
Sa;-' dx^ 



(2.3.123) 



Now we can use the expansion of the vectors v^^^ and v*^^) in terms of the frame 



on M., where v^^^ = v^^^'^-^ and v^^^ = v^^^'^^, to write this as (rearranging the indices 
slightly) 



£,(.)v(^)(x(p)) 



d 



d 



-V' " —V 

dx 



dx^ 



dx^ 



- V 



,(2),i 



d_ 
dx' 



dx^ 



+ V 



A 

dx^ 



,(2),i 



where the brackets in the last line indicate a commutator. 



(2.3.124) 



So, equation (2.3.124) has given us a drastically simpler form than (2.3.119) for the Lie 
derivative. 

So once again, the meaning of £^(1)^^^^ is the derivative of v*^^) in the direction of v*^^) 
at a given point p e M. 
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To get a feel for the geometrical meaning of the Lie derivative, consider the following 
situation. Starting at some point p G (as usual) consider once again two vector fields 
v*^-*^) and v^^) with respective families of curves q'^^^ and cp'^ Working with the coordinate 
maps once again, imagine starting from x(j)) and going a small distance e along (f^^: 



xb) ^ q(^)(e,x(2))) 

and then from q''^-*(e, x(j))) going a small distance 5 along q^^^^: 

q(^)(e,xb))— .q(^)(5,q(^)(6,xb))) 



(2.3.125) 



(2.3.126) 



Then, using (2.3.114), 

q(^)(5,q(^)(e,x(p))) 



q(^)('^,x(2)) + ev(^)(x(2)))) 

x(p) + ev(^)(x(;^)) + 5(v(^)(x(;^) + ev(^)(x(p)))) 
x(f)) + ev(^)(x(;j)) + (5v(^)(x(p)) + 5tv^^^^\-K{p)) 



9v(^)(x(2))) 



x + ev^'^ + ^v^^-" + 5tv 



(2.3.127) 



Next, consider taking the paths in the opposite order - start with a small distance 5 along 
q(^) and and then a small distance e along q^^^: 



x(p) q(^)(5,x(;,)) ^ q(^)(e,q(<5,x(p))) 

This will result in 

q^'\e, ci^'\5,^ip))) = ■ ■ ■ = X + ev(^) + ^v^^^ + 5ev^'^' 



dx^ 



(2.3.128) 



(2.3.129) 



Now we want to know if these two vectors are different. In other words if we move 
them along these two paths (g*-^^(e), q'''^\6) and q'^^^e)), how will they compare to 

each other? This will be given by simply taking the difference: 



q(^)(5,q(^^(e,x(;,))) - q(^)(e, q(^^(5, x(;,))) 



dx' 
dx^ 



(2.3.130) 
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,(1) 



q(l)( £ (p)) 



.(2) 



i(3) 



So the Lie derivative is a measure of how much two paths fail to commute. This informa- 
tion will tell us very important information about the nature of the vector fields on A4. 
We will explore this meaning later in this paper. 

Before moving on, notice that in considering vector fields on manifolds, we found in 



(2.3.115 ) that the global behavior of the flow generated by a vector field is generated by the 



element of the tangent vector at the starting point through exponentiation. And now we 
have seen that, via the Lie derivative, the commutator of the tangent vectors (which act 
as generators) provides information about the structure of the vector fields on Ai. Once 
again, this should remind you of what we did with Lie groups in [40]. We had a "param- 
eter space" where the behavior near the identity element is described by the generators, 
and we can see the global behavior through exponentiation of the tangent vectors at the 
starting point. We have also seen that the general structure of the relationship between 
the vector fields is given by the commutation relation between them. Once again, this is 
identical to the case with the elements of a Lie algebra. 

From a single point p on a manifold TM, some set of vector fields will each define a 
specific tangent vector at p. If there are n such tangent vectors, we can move along n 
different curves in Ai. What we will see is that through composition of the curves (like 



(2.3.126)) we can move to any point on an n-dimensional subspace of If n is the 
dimension of Ai, then we can move to any point in Ai through compositions of curves. 

The exact nature of how these tangent vectors, which act as generators, relate to each 
other is then given by the commutation structure of the vectors. For example consider 
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starting at the point p and first moving e along v*^^) and then moving 5 along v*^^) using 

(2.3.131) 



equation (2.3.115): 



This can be depicted similarly as in the picture above. 

Of course this operation should be equivalent to moving directly towards the new 
point, rather than moving along two different curves separately. 




So what is the relationship between the single tangent vector at p and the distance in 
terms of these two tangent vector (v*^^^ and v^^^) and distances (e and 5)1 By following 
the same calculation we did in [|40| to derive the Baker-Hausdorff-Campbell formula we 
arrive at 

= e'^"*'^+^"*'^-^^^^v(^)-'''x(p) (2.3.132) 

We can see the geometric meaning of this easily. Consider again the two paths pictured 
above. One ambiguity is that the expression e'^"^^^'e^'^*^'x(j))) involves taking the tangent 
vector v*^^) first atp, and then the tangent vector v^^^^ at the point (f^\e^p). However the 



79 



expression e^^^*'"''''^^ '~ 2^'=^v(2)'^''^''x(]9) involves both vectors being evaluated at p. Obvi- 
ously, because the curves may fail to commute (as measured by the Lie derivative), sim- 
ply adding them together at p may be wrong - and the amount it is off would be expected 
to be proportional to the Lie derivative. So, £^(2) v^^) provides the vector correction factor 
to compensate for the changes in the vector field evaluated at different points. 

Comparing what we have seen so far in this section to the definition of a Lie algebra 
in section 1.1. 5[ you can see that what we have here is indeed the same. 

So, through purely geometrical reasoning, we can once again see the same type of 
structure as we saw through purely algebraic reasoning. As we proceed this structure 
will become more and more apparent. 

Before moving on, however, we make one final comment. We just considered how to 
use the tangent mapping to evaluate the Lie derivative of a vector along a curve. We now 
consider briefly how to find the Lie derivative of a form. 

Given some covector field u;{p) = uji{p)dx^ and a vector field ^{p) = v\p)-^ gen- 
erated by the family of curves q{T,p), the derivation of the Lie derivative of uj{p) along 
the path g(r, p) is derived in almost the exact same way as the Lie derivative of a vector 
field. The biggest difference is that we obviously can't use the tangent mapping (which 
maps vectors) to map the form ( j)) at g(e, p) back to the cotangent space at p. Instead the 
obvious choice is to use the pullback g*(e, p). 

Doing this and repeating what we did above, we find that the Lie derivative of(jj{p) is 

CMp) = v^Trid^' + ^^J^dx' (2.3.133) 

Notice there is no minus sign in this expresion. The reason for this is that the tangent 
mapping maps in the same direction as the map which induces it, whereas the pullback 
operates in the opposite direction. So to map a vector from e back to p, we must go in the 
— e direction, hence the minus sign. The pullback on the other hand goes in the opposite 
direction, so to move from e back to p we start with the map which goes from p to e and 
therefore uses the +e direction, hence the lack of a minus sign. 
Finally, given an arbitrary tensor of rank (g, p), 

T{p) = Tl''-f{p)— ® ■ ■ • ® — ® dx*^ A ■ • ■ A dx'^ (2.3.134) 

we can take the Lie derivative along the curve defined by the vector field ^{p) = v 'ip)-^ 
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as 



■ dTllCt d d ■ 

C^T{p) = I: . ®---®^® dx'^ A ■ ■ ■ A dx'^ 

ax* ox^^ ox^i 

■ f d \ d 
+TllCi'' C^^^ ® • • • ® -TT— ® dx'^ A ■ ■ ■ A dx'^ 

+ ■■■ 

+Ti!i::'ti^ ® ■ ■ ■ ® \c^^] ® dx'^ A ■ ■ ■ A dx'^ 



i' ® • • ■ ® 7^ ® jC^dx'' A ■ ■ ■ A dx'^ 



d d 

+ ■■■ 

+Tt"'i'^ ® ■ ■ ■ ® 7^ ® dx'' A ■ ■ ■ A ( C^dx''^ 



(2.3.135) 



We now have the tools to consider Lie groups in fuller detail. But first, in order to 
(hopefully) provide greater insight into the meaning of the Lie derivative, we will consider 
another perspective on it. 

2.3.10 Another Perspective on the Lie Derivative 

Consider the standard basis for M", for i = 1, . . . ,n. By definition, if you move in 
the Sj direction, you aren't moving at all in the (for i =^ j) direction. This is the point of 
the standard basis. For example, in M^, moving in the direction involves no motion at 
all in the e^, direction, or vice-versa. Or in polar coordinates, moving in the direction 
no motion at all in the direction, or vice-versa. 

We can reformulate this in the following way. Consider two vector fields in 



^r^^\x,y) = v^^^'^{x,y)e^ + v^^^^^ {x, y)ey (2.3.136) 



v^'>ix,y) = v^'>'''{x,y)e^ + v^'>^y{x,y)ey 

If we take the special case v^^^'^ = v^'^^'^ = 1 and v^^^'^ = v^"^^'^ = 0, we have 

v'^^\x,y) = 

v^'\x,y) = ey (2.3.137) 

Now consider some arbitrary point p ^ xq = {xo,yo) € M^. Repeating what we did in 
equations (2.3.127) and ( 2.3.129[ >. First we move from a small displacement e in the 
v*^^) = direction, then a small displacement 5 in the v^^^ = ey direction: 

^ x;=xo + ev(^) + 5v(^) + e5t;(^)'^^ 

ox^ 

= x, + ev(^) + 5v(^) + e(5t;(^)'»^e, 

dx' ^ 



+ ev(') + (5v(^) (2.3.138) 
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Then, if we do this in the opposite order, we get 
Consequently we can easily see 



^0 ^0 



and therefore (cf equation (2.3.130)) 



(2.3.139) 



(2.3.140) 



(2.3.141) 



Notice that the origin of this Lie derivative vanishing^ is in equation (2.3.1381 - namely 
the vanishing of 



(2.3.143) 



(and the analogous term in equation (2.3.1391). In other words, the vanishing of the Lie 
derivative is a consequence of the fact that moving v^^^ = e^; in the v^^^ = e^^ direction 
doesn't change it, and vice versa. Neither field changes when dragged along the other. 
This is consistent with what we know of the Lie derivative - when you move v*^^) in the 
v^^) direction it doesn't c hange, a nd therefore the derivative is zero. 
Now consider letting (2.3.136 1 be defined b}fj 



X 



= 6^ + 



a/x^ + 1/^ \J x'^ \ y'" 



X 



d 



+ 



y 



d 



x,y) 



V^x^ + ?/2 dx + y^ dy 

y 



'x^ + y^ 

y 



\/x^ + y^ 



d 



+ 



X 



d 



a/x^ + ?/2 dx a/x2 + y2 dy 



(2.3.144) 



We will calculate the Lie derivative directly from (2.3.1241 first: 

d 



X 



+ 



y 



d 



y 



d 



+ 



X 



d 



a/x^ + y2 dx a/x2 + J/2 dy' + y2 dx y/x2 + J/2 dy 

yd X d 



rj,2 _|_ y1 Qj. J.2 _|_ y2 Qy 



(2.3.145) 



■^^Of course we could have arrived at < 2.3.14l| directly from the definition of a Lie derivative. Using 



v^'^ - ^andv( 



~ dy' 



(2) 



d_ d_ 
dx ' dy 



(2.3.142) 



^^Notice that these are simply the and basis vectors (polar coordinates) written in terms of Cartesian 
coordinates. 



82 



Then, again using (2.3.1271 and (2.3.1291, 



^0 
Xo 



(^) + 5v(^) + e<5(0) 



ev 



(2.3.146) 



and 



xo 



X. + 5v(^)+ev(^) + e5^- 



So 



_|_ ^2 ^ ^2 _^ y2 y 



(xo + ev(^) + 5v(2)) - (xo + 5v(^) + ev(^)) 
2/ 



(2.3.147) 
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x^ + y 



2^y 



(2.3.148) 



which comparison with (2.3.130 1 gives the same Lie derivative as in (2.3.145 1. 

So, the meaning of the Lie derivative of one field in the direction of another is that it 
tells us how much one field changes as you move along the other. 

2.3.11 Lie Groups and Lie Algebras on Manifolds 

We begin this section by simply redefining a Lie group. We gave significant detail as 
to what Lie groups are in [40 J from a purely algebraic perspective. Now we reformulate 
them using the geometry we have built up so far. 

Simply put, a Lie group is a special type of differentiable manifold which we denote 
Q. What makes it special is that it it has a group structure (see ||40|| for more details about 
groups). This means that we can think of every point g G Q as an element of a group. 
Clearly there must exist some well-defined group multiplication law such thatj^ 

1) 9i,9j => i9i9j) e g. 

2) {9i9j)9k = 9t{9j9k)- 

3) 3e e g 3 ge = eg = gVg e Q. 



4)Wgeg3g-' eg3gg 



-1 



9 ^9 



e. 



As a simple example consider the manifold S^. It is not by itself a Lie group. However 
we can give its coordinate 9 an additive structure such that 



e,e' eg ^ e + e' (mod 27r) e g 



(2.3.149) 



*^We aren't using any specific notation for the multiplication, such as gi ★ gj or • gj, and instead are 
simply denote the product of two element gi and gj as gigj . This won't result in any ambiguity at any point 
in these notes. 
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With this additional structure (the relationship between the coordinates, namely addi- 
tion), we have turned into a Lie group. Specifically, it is a representation of S0{2). 

Notice that this is very similar to the definition we gave in [40 J. There we defined 
a Lie group as a group where the elements depend on a set of smooth parameters. This 
definition is not really different from the previous paper - it just now carries with it the 
additional "baggage" of what we know about manifolds. And just as in 1401 we proceeded 
from that definition to study Lie groups algebraically, we now proceed to study them 
geometrically. 

Let's begin with an arbitrary point ing G Q. Keep in mind that this is both an element 
of the group and a point on the manifold. We can act on g with any other point in h G G, 
resulting in the point hg E G (we are taking multiplication in this case to be from the left - 
we could just as easily do all of this by acting from the right, and we would get the same 
results; we choose left multiplication for concreteness). 

Another way of thinking about this is that, for any element h E G, we can define a 
map 

fk-.G^G 

such that 

fh{g) = hg 

Clearly there will be an infinite number of such maps - one for each h E G- 



Naturally this will induce a tangent map Tgfh (cf section 2.3.41 and a pullback (cf 



section 2.3.6). 



One of the primary differences between a Lie group, or a Lie manifold, and a normal 
manifold is that on a normal manifold there is no way of singling out particular vector 
fields. On a Lie group/ manifold there are certain vector fields which are "special". 



Specifically, let \r{g) be a vector field on G- Just as in section 2.3.4 the map fh will 
induce a tangent map Tgfh which takes a tangent vector at g to a tangent vector at hg. 
This allow us to define an invariant vector field as one which satisfies 

T,A(v(^)) =v(%) (2.3.150) 

As an illustration of this, consider the (simple) example of with an additive 50(2) 
structure as described above. We define the map 

U-.e^ip + e (2.3.151) 

So, given an arbitrary vectoij^u^ J^, the tangent map will be (using ( 2.3.43[ >) 



vector is a vector with magnitude v'^ in the direction. 
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This makes sense. The tangent mapping simply rotates this vector around the circle with- 
out changing the magnitude. S o, consid er the vector field v(6') = v^{9)-^ = v-^, where v 



is a constant. The tangent map (2.3.1521 will map this to f J| at the point + 6. Also, we 
know that v(^/' + 6')=v(^^)by the definition of the vector field we just gave. So, we have 

=^ r,/^(v(^)) =v(V^ + ^) (2.3.153) 



Which is simply ( 2.3.150[ > for this example. Therefore, v(^) is in this case an invariant 
vector field. 

On the other hand consider the vectoipj field v'(6') = cos(^)^. Now, using the same 

=^ TgUiV{e))y^y^'{^ + e) (2.3.154) 

and so v'(6') is not an invariant vector field. 

So, for any point g E G, we can choose any tangent vector v G TgQ, and v will define 
a unique invariant vector field v (we suppress the argument (g) because we are thinking 
of the entire field here, not the field at a certain point) on Q. If v is defined at g, then the 
vector field v is defined by 

^ = {TJ,{^)\\fh e g} (2.3.155) 



45' 



The prime here is not a derivative - it is just a label. 
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In other words, the vector field at the point a = hg & Q is 



v(a) = r,A(v) (2.3.156) 

Because ^ is a group, for any starting point g and desired point a, there exists an h which 
will take you from g to a. Specifically, h = ag~^, which is guaranteed to be in Q by the 
definition of a group, will accomplish this. 

Thinking of this in the "opposite direction", any invariant vector field v will define a 
unique vector at any arbitrary point g This vector at g can be found by simply taking 
the tangent map from any point a hack to g. Specifically, we will be looking at the point in 
Q which represents the identity element e (this point isn't automatically chosen for us - it 
depends on how we choose our coordinate functions and is ultimately arbitrary). So, any 
invariant vector field v on ^ will define a unique vector at e. 

Now consider the set of all invariant vector fields on Q. Each of these will define a 
unique vector at e G ^. Or on the other hand, every vector at e will define an invariant 
vector field on Q. We denote the set of all invariant vector fields 0. Because and the set of 
all vectors at e are one to one, we can think of as a vector space with the same dimension 
as TeQ, which is the same dimension as Q. Every vector in Tf,Q has a corresponding vector 
field in 0. 

In the previous section, we defined the Lie derivative C^(i)\^^'^ = [v'^*\ v'^-'^], which is 
the derivative of v^^^ in the direction of v*^*), and is itself a vector field on the manifold. 
Let's assume that v^*) and v*^-'^ are both in 0. Will [v*^*\ v'^-'^] be in 0? We can find out by 
acting on [v*^'\ v'^-'^] (g') with the tangent map Tgfh- 

V.([v«,vW](<7)) = [r,A(v«(^)), r,A(v(^)(^))] 

= [vW(/.^),v(^)(%)] 

= [v(*),v(^')](/i(7) (2.3.157) 

We used the general relationship Tgfh[^r^'\^r^^'^] = [TJ^i^^''^), r^A(v(j))] (which you can 
easily convince yourself of by writing out the definition of the commutator and the tan- 
gent map), and to get the second equality we used the fact that v^*) and v^^^ are each 
individually in 0. 



So, equation (2.3.1571 tells us that is closed under the Lie derivative, or commutation 
relations between the tangent vectors at each point. So, for any two elements of 0, we 
have the map 

[, ]: 000^0 (2.3.158) 

We call the set of all invariant vector field along with the vector multiplication [ , ] 
(which is just the commutator) the Lie Algebra of the Lie group /manifold Q. It may be 



helpful to reread section 1.1.5 to see that this does in fact fit the proper description of an 
algebra. 

For a given Lie group, we will denote its algebra by the same letters but in lower case. 
For example the algebra of SU{5) will be denoted su{5). Once again, we emphasize that 
every vector in Tg^ corresponds to a specific invariant vector field on Q. So it is natural 
to choose a set of vectors in TeQ which form a linearly independent basis set, or a frame. 
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for TeQ. Once this is done, we can not only span T^Q with them, but the corresponding 
invariant vector fields also form a basis for g. Also, by exponentiating the elements of the 
frame at e, we can move from e to any arbitrary point on Q. This is why we refer to the 
specific frame vectors at e as the generators of ^ - by starting at the identity they generate 
all the other points in the group of the manifold through exponentiation. 

We can follow the same arguments we made in ||40l or on the previous page to see 
that the the Lie derivative [v*^*\ v*^^^] at a point results in another vector at the point, and 
therefore if we consider the Lie derivative at e, then the commutator of two frame vectors 
must also be in TeG and therefore must be equal to some linear combination of the frame 
vectors: 

[v«,v(^)]=^.,v(^-) (2.3.159) 

We call the values fijk the structure constants of the algebra. In that they define the 
value of the Lie derivative at e, which via exponentiation contains the information about 
the geometry of the flow throughout Q, the structure constants completely determine the 
structure of the manifold Q. 

To once again connect this to what we did in ||40l , we had a parameter space of some 
dimension, and every point in the parameter space corresponded to a specific element of 
the group. For example with 5*0(3), the parameter space was three dimensional, corre- 
sponding to the three Euler angles: 0, ip, and 9. Then we had three "vectors" (we called 
them generators), J^, J^, and J^, with specific structure constants [J\ J^] = eijkJ^. The 
three dimensional parameter space was then spanned by the parameters with the gener- 
ators as basis vectors. So an arbitrary point in the parameter space was 

This point in the three dimensional parameter space corresponded to the group element 

The factor of i is simply from the fact that we were trying to make the generators Her- 
mitian. Recall that the generators were defined with an additional factor of i, so you can 
effectually ignore the i when comparing this expression to what we have done so far in 
this paper. 

Restating the previous paragraph, we have a three dimensional manifold, denoted 
Q = 50(3), with a group structure defined on it. We have chosen three vectors in the 
tangent space Te50(3) denoted J^, J^, and J^, each of which correspond to a specific 
invariant vector field in 0. So, to move from the point e G 50(3) to an arbitrary point on 
the manifold, we simply exponentiate the tangent vectors at e (which are part of a vector 
field covering the entire 50(3) manifold), and follow the curves defined by those vectors. 
This moves us along the manifold to any arbitrary point. The non-trivial geometry of 
the curves defined by the vector field which produced these generators is codified in the 
commutation relations, which are actually the Lie derivatives at each point - the way the 
field is changing. 

We chose three specific elements of to get these generators - we could have just as 
easily chosen any three invariant vector fields, resulting in an equivalent set of vectors 
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at e. In the language of BOll this would be doing a similarity transformation on the gen- 
erators J*. The resulting generators would still be generators, but would correspond to 
different elements of g. 

Everything we discussed in the previous sections about vector fields, flows and fami- 
lies of curves, the Abelian groups resulting from a single curve at a point, exponentiation. 
Lie derivatives, etc. will still hold true for Lie manifolds. We can still have vector fields, 
form fields, tensor fields, etc. The only difference between a normal manifold and a Lie 
group/ manifold is that the Lie manifold allows us to single out specific vector fields as in- 
variant, and the resulting details will be constrained by the invariance of the vector field 
under the action of the group. This will (as you can recall from [40J) result in tremen- 
dously convenient and powerful properties when we begin to once again discuss physics 
using Lie groups. 

2.3.12 Lie Group Manifolds Acting on Manifolds 

Finally, we want to extend these ideas from the geometry of a Lie manifold to the 
action of a Lie manifold Q on another manifold Ai. The action of ^ on is a map a : 
G ® M — > M. In other words, a specific element g E G takes a specific element of M to 
another point in A^. To see this more clearly, recall near the beginning of section 2.3.9 we 
defined the familiar of curves g(r, p) which took the point p E A4 to another point g(r, p) 
along the curve defined by q passing through the point p a "distance" r. 

What we are doing here is very similar. The difference is that, instead of an arbitrarily 
defined curve q, the curves in Ai are defined by the invariant vector fields of ^. As a 
simple example, consider M = M."^ and G = 5*0(2). For any point p = {x,yY E M. = M?' 
we can can choose a point g = 9 E G = 5*0(2) and act on p, moving it to another point in 
M^. For example if we choose (x, y)^ = (1, 0)^ E M^, and the element | E 5*0(2), the action 
of I on (1, 0)^ will be to take it to (0, 1)^ E M^. Or in the language of the above paragraph, 
the action of 50(2) on is the map a : 50(2) O — > Then, for example, 

a(7r/2,(l,0)^) =(0,1)^ (2.3.160) 

More generally, a has the following properties: 

1) a{e,p) =p. 

2) a{gi,(T{gj,p)) = a{gigj,p). 

If the Lie manifold G acts on in a well-defined way, we say that ^A "carries", or 
"sits in" the group G- 

Now we can go back to everything we have done in these notes so far that depended 
on a smooth map, the tangent mapping, the puUback, flows, etc., and do them all over 
again where the smooth maps are the actions of ^ on A^. 

As an example, consider once again the action of 5*0(2) on M^. We will work with 
the 2x2 matrix representation of 5*0(2). We want to use an invariant vector field on 
5*0(2), which is simply a constant tangent vector on the circle. So obviously the element 
of this vector field at the identity e of 5*0(2) will be this constant vector. 5*0(2) has general 
element 



cos 6 — sin 6 
sin 6 cos 6 



(2.3.161) 
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We know that the tangent vector to this at the identity e will be (cf [40]) 

= (l ~0^) ^ ^^"^^(2) (2.3.162) 
Exponentiation of this with the parameter 9 will then properly give 

e'^^=h'^, -''""f)^ (2.3.163) 
V sm 6 cos 6 / 



This is the exact situation we had in the example leading to (2.3.113 1. Working backwards, 
we can show that this action of SO (2) on has induced the flow of circles around the 
origin through some starting point. And this flow will induce the vector field 

w{x,y) = -y— + x— (2.3.164) 

We can do the same thing with the action of 5*0(3) on (see BOl for details), and find 
that there are three induced vector fields given by 

'{x,y,z) = -^^ + y^ 
d d 

v^^\x,y,z) = -x^ + z— 
ox ox 

w^'\x,y,z) = -y^+x^ (2.3.165) 

We leave these details to you. 

It is these vector fields induced on a manifold Ai by the action of a Lie group Q which 
will prove to be physically useful, as we will see later. The point of all of this is that the 
Lie group specifies certain vector fields on the manifold. 

2.3.13 Concluding Thoughts on Differential Topology 

We have defined differentiable manifolds and discussed the types of objects that can 
live on them, the types of mappings between them, and how we can add structure to 
them. However our discussion has left much to be wondered about. 

For example, while our complete dedication to a "coordinate free" approach has been 



(and will be) useful, there is much lacking. As we discussed briefly on page 41 there is 
no way in our current approach of telling the difference between a perfect sphere and 
an egg. In fact, we could stretch, twist, bend, and reshape S'^ all we want without any 
need to change the coordinate functions. In other words, we only have descriptions of 
manifolds "up to homeomorphism". 

What if we wanted to discuss the distance between two points on a manifold? There 
is no well defined way of answering that question in what we have done so far. Another 
ambiguity is angles. If we have two lines intersecting, how can we determine the angle 
between them? We could always stretch or twist the manifold however we want, thus 
changing the angle. 
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Another difficulty is the relationship between vectors and covectors. While we have 
discussed the differences between them, how are they related? Certainly there must be 
some relationship between the column vector (a, b, c)^ and the row vector (a, b, c). 

And finally, consider the manifold R.'^. At first glance this is simply four dimensional 
Euclidian space. But consider Minkowski space. This is also a flat four dimensional 
manifold. Topologically there is no real difference between and Minkowski space, 
yet we know from physics that they are extremely different. What is it that makes this 
difference? 

The answers to these questions will be dealt with when we introduce the idea of a 
metric on a manifold. However before we do this, there are a few more things we can say 
about manifolds defined only "up to homeomorphism". For example, while we can't tell 
the difference between a sphere and an egg, or between M'^ and Minkowski space (yet), 
we can certainly tell the difference between S'^ and the torus T^. They are topologically 
different, and we don't need geometry to see the differences - there is no way to stretch, 
twist, or bend one to look like the other. So we will take a short hiatus from our geomet- 
rical considerations to focus on a few key issues in topology. These topics will allow us to 
categorize spaces based on qualitative properties indicative of their topological structure, 
rather than on quantitative properties indicative of their geometric structure. 

2.4 References and Further Reading 

The primary source for this chapter was [[T7|, though we also made use of ||37ll . For 
further reading we recommend 0, H, IH, BH, BH, 13, or 13. 
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Chapter 3 

Algebraic Topology 



Before diving in, we briefly discuss the point of this chapter. As we said previously, topol- 
ogy is concerned with the qualitative aspects of spaces, rather than the quantitative. In 
other words, it isn't concerned with differences of length or angle, but of the general 
properties, or categories, of shapes. 

Another way of putting this is to ask the question "when I stretch, compress, twist, 
bend, and generally reshape a space, but I do so without breaking the space or adding 
points to it, what doesn't change?" For example, consider the annulus (a disk with a hole 
cut in it): 




One could categorize this as a two dimensional shape with a single hole. Furthermore, no 
matter how I stretch it, twist it, bend it, etc., it will still be a two dimensional shape with 
a single hole in it: 
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We say that "a single hole in in it" is a topological invariant of this two dimensional shape. 
We could also consider the two dimensional disk with two holes in it: 




Once again, we can stretch this all we want, but as long as we don't break it or add points 
to it, it will always be a two dimensional shape with two holes in it, and we can't make it 
look like the annulus by just reshaping it. 

So two dimensional holes are a topological invariant. Now, consider the sphere 
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Clearly this has a hole in it, but it isn't a "two dimensional" hole - the hole in this case is 
three dimensional. Again we can deform this: 




but it is always a space with a three dimensional hole. 
But then consider the torus: 




This also has a three dimensional space missing from its volume, but it is obviously a very 
different type of three dimensional "hole" than was in the sphere. The "double torus". 



93 



is also different from both the torus and the sphere. 

So, in that topology is concerned with categories of spaces and shapes, topology is usu- 
ally discussed in terms of "what doesn't change when we stretch, twist, etc. a shape?" 
This section is devoted to three of the most commonly discussed and powerful types of 
such topological invariants: homotopy, homology, and cohomology. The ultimate fulfill- 
ment of this chapter will be when we finally get to more advanced topics in string theory 
later in this series (though we will also use them quite a bit when doing QFT in the next 
paper in this series). The purpose of this chapter is not to give a comprehensive treatment, 
or even a particularly deep treatment. We merely intend to give the basic idea of each and 
move on. We will discuss all three in greater depth both in this paper and later in this se- 
ries. Many of the more interesting features of all three live in the merging of topology and 
geometry, especially in the theory of fibre bundles, which we will discuss later in these 
notes and in later papers. For now we are merely interested in planting seeds. It will take 
a while for them to grow into something useful, but we will see that topology provides 
some of the most powerful tools for advanced physics. 

When we do discuss these ideas again later in this paper, we will consider a few of the 
non-string theoretic applications of topology in physics. 

The general area of topology these topics are usually studied in is called "algebraic 
topology". It is an approach to topology in which we can take questions which are inher- 
ently topological but extremely difficult for topology to answer, and rephrase them in the 
language of algebra (usually making them easier to answer). Once again, we will discuss 
this later in this paper and subsequent papers. We mention this now to give the larger 
context of what we are doing. 

We will be talking about "spaces" X throughout this chapter. We are doing this to echo 
the mathematics literature. We will eventually be interested in specific types of "spaces". 
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namely manifolds. For now bear with us as we talk about spaces, keeping in mind that 
we will eventually be talking about manifolds. 

As one final comment, we admit up front that this section will be lacking a tremen- 
dous amount of rigor. A glaring example is that we aren't going to go through the usual 
introduction to topology. We won't discuss things like topologies on sets, covers, com- 
pactness, and so on. While we are losing quite a bit by skipping this material, but we are 
gaining brevity. As we said before, the purpose of this series is to provide the "forest" 
rather than the "trees". We will depend a great deal on the intuitive ideas of "space" and 
other related concepts. We will provide greater depth when absolutely necessary. We do 
provide references for suitable introductory topology and algebraic topology texts, and 
we encourage the reader to study those (or take a course) after or during reading these 
notes. 

With all of that said, we can begin. 
3.1 Homotopy 

3.1.1 Qualitative Explanation of Homotopy 

Consider the spaces M and M^. We can plainly see that these spaces are not homeomor- 
phicnto each other. But how could one prove this mathematically? 



One way would be to consider removing a single point from M. If this is done, the real 
line is broken into two disjoint pieces. For example if we remove the point 0, the space 
M — {0} is broken into two spaces, and if you are on one side, say at —1, it is impossible 
to move to the other side, say +1, without leaving the space. 

But with the plane M?, the removal of a point doesn't have the same affect. Removing 
a single point leaves the plane very much intact. This difference between M and M? is 
a fundamental topological distinction, and it illustrates one very important topological 
idea - connectedness. There are several different types of connectedness, and we will 
discuss several of them in this section. For our purposes now, however, the intuitive idea 
of what you'd expect "connected" to mean will work. When M is split into two parts, it 
obviously isn't a connected space - there are two rfzsconnected parts. minus a point, on 
the other hand, is still one single space. This idea of connectedness is our first example of 
a topological invariant for this chapter. Of course, saying that space Xi and space X2 are 
both connected doesn't mean that they are homeomorphic, but if one is connected and 
the other isn't we can obviously say that they aren't homeomorphic. 

Now consider the case of and M^. Obviously the removal of a single point from 
either leaves each of them connected, and therefore our method above of removing a 
single point doesn't help us. However, consider removing a point from each and then 
drawing a loop around the missing point in each. Then ask the question "can this loop be 
smoothly retracted to a point (other than the missing point)?" With the answer is "no", 
because this would require the loop to pass through the missing point, which it cannot 



^Recall that a homeomorphism is a mapping where a space merely changes shape without changing 
topology. In this chapter you can read "homeomorphic" as "having the same topological structure" 



do: 
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On the other hand, with this isn't a problem - we can simply use the extra dimen- 
sion to move the loop around the missing point and then continue bringing it to a single 
point. So this "loop" approach has given us a way of distinguishing and M^j^ 

If we move on to compare M.^ to M'^, drawing loops around a missing point once again 
fails us, because in either space we have the "room" to simply slide the loop around the 
missing point. But consider a circle S"^ being drawn around the missing point. In there 
is an extra dimension to slide the circle around the missing point, but in M'^ the circle is 
"stuck" around the point. 

We can generalize this to say that to compare M" to ]R"+^, we must remove a point from 
each and then use ^"^^'s around the missing point to see if they can be retracted down 
to a single point (other than the missing point). In each case, the S'"^ can be retracted in 
M"+^ but not in M". 

As an example that doesn't use Euclidian space, consider the circle S^. If we choose 
a point on the circle and draw a loop around the circle (coming back to the same point), 
then we obviously can't retract this circle to a point without leaving the circle. However, 
any loop on S"^ can be retracted in this way. The torus is another example of space 
where some loops can be retracted and some cannot: 

^Of course there are many other ways - we are taking a route that serves our purposes for this section. 
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The torus, however, provides two options for loops that cannot be retracted. 

All of these ideas are made precise with the introduction of homotopy groups. For a 
given space X, we can define an infinite number of groups, which we denote vr„(X) for 
n = 1,2,..., where each 7r„(X) is called the n*^ homotopy group of X. The meaning of the 
n*'^ homotopy group is that it keeps track of the number of distinct ways an n-sphere (S*") 
can be mapped into X. For example with — {0} there were two different ways to map 

into it - where it didn't go around the missing point and could therefore be retracted to 
a point, and where it did go around the missing point and therefore couldn't be retracted 
to a point. 

3.1.2 Path Homotopy and the Fundamental Group 

We will restrict our discussion in this chapter to the first homotopy group 7ri(X) for 
two reasons: 1) it has a unique importance in physics, and 2) higher homotopy groups 
are extremely difficult to calculate and we don't have the machinery yet. With that said, 
even to study 7ri(X) we need a few definitions first. 

The first definition is a specific idea of connectedness. We say a space is path con- 
nected if for any two points Xj, Xj G X there is a smooth path that can be drawn connect- 
ing them. This lines up with the intuitive idea of connectedness, and the only examples 
of spaces that can be considered "connected" but not path connected are mathematically 
interesting but not relevant for our purposes. Any space we consider that is not made up 
of disjoint parts is path connected. 

Next we define a specific type of pathj^ A path from xq to in a path-connected 

^It may seem strange that we defined path connectedness in terms of paths, and are now defining paths. 
The notion of path above in the definition of path connectedness was intended to be more intuitive - it 
simply meant that if you are standing on xi then you can get to Xj without leaving the space. Here we are 
giving a more formal definition of a different (but admittedly related) concept. 
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topological space X (where xq, e X) is a continuous map a such that 

a : [0, 1] — >X (3.1.1) 

such that 

a(0) = Xq 

= xi (3.1.2) 

The points xq is called the initial point, and xi is called the terminal point. 

In the previous section we talked loosely about "retracting" a loop to a point. We want 
to make this more clear. Consider two different paths from xq to xi, which we call a and 
(5. Obviously we will have q;(0) = /9(0) = xq and a{l) = (3{1) = xi, but a and f3 can differ 
in between. We say that a and f3 are path homotopic relative to the set {0, 1} if there exists 
a continuous map F such that 

F : [0, 1] X [0, 1] — >X (3.1.3) 

which satisfies 

F{0,t) = xo and F{l,t) = xi e [0, 1] 

F{s, 0) = a{s) and F{s, 1) = P{s) Ws E [0, 1] (3.1.4) 

To understand this more clearly, consider the two paths: 




The idea is that the first variable in F{s, t) represents a point along the path. The second 
variable represents a parameterization of an infinite number of paths "in between" a and 
p. 
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^1 



For example the path at i = 1/2 would be somewhere in between a and (5 




There are an infinite number of paths from xq\.o xi, one for each real number t e [0, 1]. 
We call the map F(s, t) a path homotopy. If such a map exists so that a and (5 axe path 
homotopic relative to {0, 1}, we denote it 

a ~ /3 : {0, 1} (3.1.5) 

The meaning of this is that the tilde ~ indicates that a and (3 are homotopic, and what is 
to the right of the colon is where they are identical - in this case they are only the same at 
and 1. 

As a simple example, if X = M", where xq and xi are any two points in M", define 

F(s,t) as 

F{s,t) = {l-t)a{s) + tp{s) (3.1.6) 
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This obviously satisfies (3.1.41 as long as a and /3 each satisfy (3.1.21. 

On the other hand consider once again — {0}. Let xq be on the positive x axis and 
xi be on the negative x axis. Then let a be a path from xq to xi that goes above the missing 
point at the origin, and then let /3 be a path that goes under the missing point. 




Clearly no such F{s, t) can be written because the homotopy would, at some point, have 
to pass over the missing point. Therefore F wouldn't be continuous, and the definition 
does not hold - in this case a and {3 are not path homotopic. 

And in general, whether two paths from xq to Xi are path homotopic is a question 
about the topology of the space X. 

The foundational idea of homotopy is that, for any points xq, xi G X, the set of all 
paths from xq to xi which are path homotopic to each other forms an equivalence rela- 
tionlf] 

To be more precise, if X is a topological space and a, (3, and 7 are paths from xq to xi, 
then we have 

1) a~a : {0,1} 

2) a ~ /3 : {0, 1} /3 ~ a : {0, 1} 

3) a ~ : {0, 1} anti /? ~ 7 : {0, 1} ^ a ~ 7 : {0, 1} 

''if you aren't familiar with equivalence relations or equivalence classes, you are encouraged to review 
them in one of the items in the references and further readings section. The basic idea is to define a set 
of items which can be related to each other in some way. In this case we say that they are "equivalent", 
and all of the items which are equivalent form an "equivalence class". For example consider the disk - we 
could say that two points on the disk are "equivalent" if they are at the same radius. Then the equivalence 
class would be the collection of all points at a certain radius. This would result in an infinite number of 
equivalence classes, all parameterized by the radius r. On the other hand, we could impose the equivalence 
relation that all points on the same line from the center of the disk are equivalent. Then there would be an 
infinite number of equivalence classes parameterized by the polar angle. The standard notation for two 
things being "equivalent" under some equivalence relation (which has been defined in the text) is that a 
and h are equivalent is a ^ &. This footnote should suffice for what you need to know about equivalence 
classes for these notes. 
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All three of these should be intuitively clear, and we therefore omit a rigorous proof. We 
will denote the set of all paths that are path homotopic (equivalent) to a as [a] . 

As an example, consider again — {0} with xq and xi on the positive and negative x 
axis, respectively. It is clear that for any two paths that both go above the missing point 
at the origin, a path homotopy F{s, t) can be written. Therefore we can say that all paths 
which go above the origin are equivalent. Similarly all paths that go below the origin are 
path homotopic and therefore equivalent. Therefore we have two equivalence classes, or 
"types of paths", from xq to xi. We could choose any (single) arbitrary path a above the 
origin and any (single) arbitrary path (3 below it, and then the set of all paths above the 
origin would be denoted [a], and the set of all paths below the origin would be [/?]. This 
is not to say that there exist only two elements. One could also have a path which looped 
around the origin: 




This would define yet another equivalence class. Notice that we could define clockwise 
as positive and counterclockwise as negative, and we could have paths which looped 
around the origin any integer number of times. We will explore this relationship between 
loops and integers more soon. 

Now define the backwards path of a, denoted a^, as 

= a{l - s) (3.1.7) 

The backwards path of a path a is again a map : [0, 1] — > X, but it goes in the opposite 
direction as a.. So if o; is a path from xq to xi, then is a path from xi to xq. We can see 
(though we do not prove because it is a fairly intuitive result) that if two paths a and (3 
are path homotopic, a ~ /3 : {0, 1}, then ~ (3^ : {0, 1}. 

We consider three points in X, xq, xi and X2. Let a be a path from xq to xi, and let /3 be 
a path from xi to X2. Define the composite map af3 : [0, 1] — > X as 

a(3{s) = a{2s) , < s < 1/2 

(3(2s-l), l/2<s<l (3.1.8) 
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It should also be clear (though again we do not prove) that if cti and a2 are both paths 
from xq to xi, and /9i and ^2 are both paths from xi to X2, then 

ai ~ q;2 : {0, 1} and P2 ^ P2 : {0, 1} <^ aiPi ~ a2P2 : {0, 1} (3.1.9) 

We can define equivalence classes of backwards paths as well. Naturally we will have 
the relation 

[a]^ = [a*-] (3.1.10) 

This definition does not depend at all on which element of [a] we choose. 

It is also easy to see that we can define the product of equivalence classes as 

[a][(3] = [a(3] (3.1.11) 

as long as /3 begins where a ends. 

Now we get to the crux of homotopy. If a path a satisfies 

a(0) = (3.1.12) 

then we say that a is a loop at a(0) = a{l) = xq. Notice now that if a and f3 are both loops 
at Xq, there will never be a problem with defining a^. 
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One loop we will work with a great deal is the trivial loop, denoted c, which is defined 

as 



c(s) = xo Vs G [0, 1] 



(3.1.13) 



It is a "loop" that never leaves the point xq. We will denote the set of all loops equivalent 

to c as [c]. 

More generally, the set of all homotopy classes of loops at xq is denoted 7ri(X, xq). The 
most important result of this section is that this set has a group structure. Once we rec- 
ognize this structure, we call 7ri(X, a;o) the Fundamental Group, or the first homotopy 
group, of X, with base point xq. We define the group structure as follows: 

1) A given element of the group is [a], where a is a loop at some point Xq. It turns 
out that the point xq chosen doesn't matter as long as the space is path connected. How- 
ever, it is important to keep track of where the base point xq is. While the choice of base 
point doesn't affect the fundamental group of the space, loops from two different base 
points are always two different points. In other words, if a loop can only be continuously 
deformed to another by moving its base point, those two loops are not homotopic. How- 
ever, because the fundamental group doesn't depend on the choice of base point, we will 
omit it from our notation for most of this section. 

2) Group multiplication between two elements [a] and [/3] is [a][/3] = [a(3]. 

3) The identity is [c] . 

4) The inverse of an element is [a]~^ = [a^]. 

For example, consider once again the space X = M^. Choose an arbitrary point, say 
the origin, and consider loops from the origin. Clearly, because there are no holes in M^, 
any loop will be path homotopic with the trivial loop. 



Therefore every loop in 7ri(]R^) will be in [c]. Thus vri(]R^) only has the single element ([c]), 
and is equal to the trivial group with one element (cf 14011 if this is not familiar). 

Now consider — {0}. Choose an arbitrary point, say at (x, y) = (1,0) to be the point. 
Clearly there will an infinite number of loops that are path homotopic to c. We could also 




103 



make a loop which goes around the missing point a single time counterclockwise. We 
denote an arbitrary loop of this type a+i, and the equivalence class of all such loops [a+i]. 
We could also go around counterclockwise twice - denote this equivalence class \c(+2\- 
Generalizing, we could go around counterclockwise any number n times - we denote this 
equivalence class It should be clear that [q;+„] and [a+^j are not the same for n^m: 
it is not possible to continuously map a loop that goes around n times to a loop that goes 
around m times. 

We could also have a loop that goes around clockwise some number n times. We denote 
this equivalence class [ct-n]- So the set of all equivalence classes will be 

TTi (M^ _ {0}) = {ao, a±i, q;±2, • • • } (3.1.14) 

Furthermore, if we have an arbitrary element of, say [ai] and an arbitrary element of, say, 
[q;_i], then their composition will be 
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which is expressed as 

[q;i][q;_i] = [aitt-i] = [ai-i] = [ao] (3.1.15) 

Or more generally, 

[anlfttm] = [ttn+m] (3.1.16) 

So finally we can see that the fundamental group of — {0} is the group Z (with 
addition). 

As one final definition for this section, we say that a space X is simply connected if 
7ri(X) is equal to the trivial group with one element. With a little thought you can see 
that therefore is simply connected for all n, and that — {0} is simply connected 
for n > 2. The circle is not simply connected, but 5" is for n > 1. The sphere S"^ is 
therefore simply connected, but so is 5*^ — {p} where p is an arbitrary point on the sphere 
(take a moment to convince yourself that S"^ — {p} is homeomorphic to a disk). However 
S'^ — {p} — {q} (the sphere with two points missing, which is homeomorphic to a cylinder) 
also has fundamental group equal to Z (convince yourself of this) and is therefore not 
simply connected. Another way of saying this is that "not all loops are contractible". 

3.1.3 More Examples of Fundamental Groups 

Because detailed calculations of homotopy groups (even fundamental groups) can be 
extremely difficult, and we don't have the mathematical "machinery" to do so, we will 
take a section to merely quote the fundamental groups of various spaces. These exam- 
ples should provide a good intuitive understanding of how homotopy groups, especially 
fundamental groups, behave. 

As our first (uninteresting) example, we mention once again that the fundamental 
group of the circle 5^ is 

t:i{S^) = Z (3.1.17) 

As another example, consider two spaces Xi and X2. We can form the product space 
X = Xi® X2. What will the relationship between 7ri(Xi), 7ri(X2), and 7ri(X) be? To see 
this, consider maps pi which "project" from the product space X onto the Xf 

Pi : Xi ® X2 — y Xi 

Pi{xi,X2) = Xi for i = l,2 (3.1.18) 

Any loop ma e X can then be projected to either X^: 

Pi{a{s)) = ai{s) e Xi (3.1.19) 

On the other hand, any two loops ctj e X^ give a well defined loop in X. So define a map 

0:7ri(X) 7ri(Xi)©7ri(X2) 

= ([«i],[«2]) (3.1.20) 
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clearly preserves group structure: 

0([«/?])= ([«l/?l],[«2/52]) (3.1.21) 

and can be inverted: 

= H (3.1.22) 

and therefore is an isomorphism]^ between the fundamental group of X and the direct 
sum of fundamental groups of Xi and X2. In other words, we have the extremely useful 
result 

7ri(X) = 7ri(Xi ® X2) = 7ri(Xi) © Trails) (3.1.23) 

As a simple illustration of this consider the infinite cylinder. You can likely guess 
(with a little thought) that the cylinder will have the same fundamental group as 5^. But 
we can prove this using ( 3.1.23[ > as follows. The cylinder can be written as M (S> S*^. We 



know 7ri(M) = 1 and 7ri(S'^) = Z. Therefore the fundamental group of the cylinder is 
7ri(M (g) S^) = 7ri(]R) © 7Ti{S^) = 1 © Z = Z, which is what we expected. 

A less obvious example is the torus T^. This can be written as = S^. And 
we know that 7ri(S'^) = Z, and therefore vri(r^) = Z © Z. So an arbitrary element of the 
fundamental group of will be [ctm])/ where is a loop around one {n times), 
and am is a loop around the other (m times). 

Another example is the n-dimensional sphere 5*". The fundamental group is 



7ri(^") = 1 (3.1.24) 



(the identity) for n > 1. 
For the Mobius strip Ai, 



TiiiM) = Z (3.1.25) 
As one final extremely important example, consider the "figure-eight" 




^We will explain what an isomorphism is in the next section, but the basic idea is that it is a way of 
saying that two groups are the same. 
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This has a non-Abelian fundamental group group. Namely, given two elements gi and 
an element of the fundamental group of the figure-eight is the set all all elements made 
from products of these elements. For example the identity would be gi = 92 = 1/ which 
would be the identity. Another arbitrary element would be gfglgi^- Yet another would 
be g\g2^. The product of these two would be 

glglgi'gbf = gfgl (3.1.26) 

which is also an element. Obviously the order of this group is infinite, and it is non- 
Abelian: 

gm 7^ g2gi (3.1.27) 

You can see the non-Abelian nature of this group as follows: consider a loop a around 
the left side of the figure-eight, and another loop /3 around the right side. Consider the 
composition a/3: 




This is the path where you go around the left side first and then the right side. The 
opposite path, f3a, consists of going around the right side first and then the left side. 
If you were to try to continuously deform a/3 into /3a, you would have to move the base 

^The elements gi and 52 correspond to wrapping around the two sides of the figure-eight some number 
of times - for example gfgl would be wrapping around one side four times and the other side seven times. 
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point of both loops in order to do so. As we mentioned in the previous section, if you must 
move the base point of a loop to move it to another loop, those loops are not homotopic. 
Therefore a l3 is not homotopic to Pa. 

3.1.4 Homotopy and Homomorphisms 

The point of homotopy is the intuitively clear fact that the fundamental group is a 
topological invariant of X. If X has fundamental group tti (X), then now matter how you 
stretch, twist, etc. X it will have the same fundamental group. 

This fact can be made more formal by considering maps from one space to another. 
But first we need another definition. A homomorphism is a map from one group to 
another that preserves the group structure. For example if /i is a homomorphism from 
group Gi (with group multiplication -ki) to group G2 (with group multiplication -k2), then 
for Qi, Qj e Gi, it must be true that h{gi ^1 gj) = h{gi) ^2 h{gj). This simply means that if 
you map every element of the group through h, you still have the same group structure. 

But notice that if G2 is the trivial group, so h{gi) = 1 Vi, we still have a homomorphism. 
In other words, h doesn't have to be invertible to be a homomorphism. This leads to the 
stricter idea of an isomorpism. An isomorphism is an invertible homomorphism from one 
group to another. This type of map preserves the entire group structure - nothing is lost. 
So a map which takes every element of Gi to the identity is a homomorphism but not an 
isomorphism. You can generally think of an isomorphism as being a group theoretic way 
of saying "equal". If Gi and G2 are isomorphic, they describe the same general structure. 
Of course, what they act on may be different - the j = 1/2 and the j = I representations of 
SU{2) are isomorphic, but they act on very different things. The groups S0{2) and U{1) 
are isomorphic, but they act on different things. 

So, to understand homotopy more clearly, we introduce a "homomorphism induced 
by a continuous map". Suppose now that / : Xi — > X2 is a continuous map that carries 
xi e Xi to X2 ^ X2. We can introduce the notation 

f:(X,,xi)^{X2,X2) (3.1.28) 

where the first item in parentheses represents the spaces / maps from and to, and the 
second item represents specific points / maps to. 

If a{s) is a loop in Xi with base point xi, then the map f{a{s)) is a loop in X2 with 
base point f{xi) = X2- Therefore the map / defines a homomorphism, which we denote 

from 7ri(Xi) — ^ vri(X2). 

This leads to the natural result (and the point of this section) that if / : (Xi, xi) — > 
(X2, a;2)is a homeomorphism, then is an isomorphism between 7ri(Xi) and 7ri(X2). In 
other words, if Xi and X2 are homeomorphic, then they have the same fundamental 
group. 

Of course the converse is not necessarily true - two spaces having the same funda- 
mental group does not necessarily mean that they are isomorphic. For example Si and 

- {0} both have t^i{S^) = 7ri{R'^ - {0}) = Z, but they are not homeomorphic. The 
invariance of the fundamental group allows us to say that, if two spaces do not have the 
same fundamental group, they are not homeomorphic. 
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3.1.5 Homotopic Maps 



As one final idea regarding homotopy, we can not only apply homotopy to spaces, but 
also to maps between spaces. Recall in the previous section that we mentioned that the 
the fundamental group is invariant under homeomorphisms, but two spaces having the 
same fundamental group doesn't guarantee that they are homeomorphic. This section 
will illustrate why. 

Consider two different maps: 

f,g:X,^X2 (3.1.29) 
We say that these two maps are homotopic maps if there exists a continuous map 

F:Xi^ [0, 1] — > X2 (3.1.30) 

such that 

F{x,Q) = f{x) and F{x,l) = g{x) (3.1.31) 

For example if X1 — X2 — R, and f{x) — sin(x) and g{x) — x^, then we could define 

F{x, t) = {1- t) sin(x) + tx'^ (3.1.32) 

which is well defined at every value of x and t. Therefore sin(a;) and are homotopic 
maps from M to M. 

now consider again two spaces Xi and X2. We say that Xi and X2 are of the same 
homotopy type, which we denote Xi ~ X2, if there exist continuous maps 

g:X2 — >Xi (3.1.33) 

such that f{g{x2)) and g{f{xi)) are each homotopic maps with the identity. 

For example consider R and the point p (we are treating p as a space by itself, not a 
point in M). We can define / : M — > p hy f{x) = p \/x e M, and then g : p — > M 
by = e M. So, the map f{g{p)) takes p to itself, and is trivially homotopic to the 
identity. The map g{f{x)) will take every point x e R to e R. The identity map on R is 
id]^{x) — X, so we can define 

F{x,t) = {l-t)x + tO = {l-t)x (3.1.34) 

which is continuous and therefore p and M are of the same homotopy type. 

Now consider M and S^. We can define / : M — > as f{x) = {x mod 2tt) G S^, 
and g : — > R as g{9) = 9 eR. Then f{g{0)) = 6, which is obviously homotopic to the 
identity. But g{f{x)) — x mod 27r is not homotopic to the identity. Therefore and R are 
not of the same homotopy type. 

The amazing (and extremely useful) point of all of this is that if two spaces are of the 
same homotopy type, then they have the same fundamental group even if they are not 
homeomorphic. 
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Clearly homotopy type forms a less strict classification of spaces than homeomor- 
phism. There is much, much more we could say about homotopy, but we will save those 
ideas for later. We trust that this section has provided a basic understanding of what ho- 
motopy is, and how it can be used to classify spaces. The two main ideas to take away 
from this section are that 1) the fundamental group tti{X) is invariant under homeomor- 
phism, and 2) spaces of the same homotopy type form an equivalence relation among 
spaces - if two spaces are of the same homotopy type, they have the same fundamental 
group. 

3.2 Homology 

3.2.1 Qualitative Explanation of Homology 

Consider the two-dimensional disk and S^: 




Obviously they are topologically different, but how can we characterize this difference? 
Homotopy is one obvious option (since ni{D'^) = 1 and iiiiS^) = Z). But as we saw in 
the previous section, spaces can be homotopic but not homeomorphic. Therefore, if we 
want to classify spaces as much as possible, having more topological invariants would 
be helpful. Also, as we mentioned above, homotopy groups can be extremely difficult to 
calculate. Something easier would also be nice. 

With that in mind, one obvious thing we can notice is that and are the same, 
except contains its interior, whereas does not - is the boundary of D"^. We can 
also think of this in the following way: is equivalent to with its interior filled in. 

is without the interior filled in. So D'^ has an that is the boundary of something, 
whereas has an that is not the boundary of something. And the fact that in S^, the 

is not a boundary is what implies that there is a hole in 

''This may seem strange or even silly, but bear with us. 
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On the other hand consider the annulus: 




We can once again think of the outer boundary as an that is the boundary of something. 
However we can also think of the inner boundary as an S-^, but this is not a boundary 
(there is nothing inside it). Once again this implies a hole. 

More generally, we can take a loop (something homeomorphic to S^) in a space such 
that the loop is not a boundary of something to be an indicator of a hole in the space (if 
this is not obvious, think about it for a second). 

Generalizing once again, we can consider the three dimensional ball and the sphere 
5"^. Here, S'^ is the boundary of B^, but does not have a boundary itself. So, the 
presence of an that is not the boundary of anything indicates a hole, but a different 
type of hole than the presence of a loop that is not the boundary of anything. 

This is the basic idea of homology - an object that does not itself have a boundary is an 
object that can be the boundary of something else. Therefore an object that does not have 
a boundary, and is not the boundary of anything, will indicate a certain type of hole (a 
two dimensional hole as in or the annulus, or a three dimensional hole as in S'^, etc.). 

Intuitively it is likely clear that there must be some similarity between homology and 
homotopy somewhere under the surface. Indeed there is, and we will discuss this simi- 
larity later. 

Once again, we are doing algebraic topology, and therefore our goal will be to assign 
a group structure to the topological ideas we are dealing with. We will therefore need a 
few additional algebraic ideas, but we will introduce them as we go instead of all at once 
up front. We feel this will ease learning in this particular section. 

3.2.2 Simplicial Complexes 

We begin by considering a nice geometric way of representing almost any space. A 
simplex is a building block for larger spaces. We often preface the word "simplex" with 
the dimensionality of the simplex. So an n-dimensional simplex is an "n-simplex". 

To get a feel for the structure of an n-simplex, we give an oversimplified definition 
that let's us get the basic idea. We will define a n-simplex in ]R"+^ as the set of all points 
in ]R"+^ with non-negative coordinates which all add up to 1. 

For example, the 0-simplex will be the set of all points in ]R°+^ = M with positive 
coordinates adding up to 1. This will obviously be the single point at 1 e R. So, the 
0-simplex is a single point. 
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The 1-simplex in will be the set of all points (x, y) e R such that x + y = 1 (with 
x,y > 0). In other words, this will be the points on the (all positive part of the) line 

y = 1 — X, or. 




(1,0) 



So the 1-simplex is a solid line. 

The 2-simplex in will be the set of all points {x, y, z) E such that x + y + z — 1 
(with x,y,z > 0), or (the positive part of) the plane defined hy z = 1 — x — y, or. 




So the 2-simplex is a solid triangle. 

Continuing, the 3-simplex in will be a solid tetrahedron: 
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These examples should give you a good idea of the basic structure of simplexes. As 
a comment, we are not really interested in the specific definitions given above, settled so 
neatly in R". We are interested in the general structure of simplexes (i.e. a 0-simplex is a 
point, a 1-simplex is a line, a 2-simplex is a triangle, etc.), which this definition illustrates. 

Notice that in each case, the n-simplex is a collection of (n — l)-simplexes with the 
space between them filled in. For example the 1-simplex is two points (0-simplexes) with 
the one-dimensional space between them filled in: 



The 2-simplex is three 1-simplexes (lines) with the two-dimensional space between them 
filled in: 
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The 3-simplex is four 2-simplexes (triangles) with the three-dimensional space between 
them filled in, etc. So in general, an n-simplex will be a collection (n + 1) (n — l)-simplexes 
arranged together with the n-dimensional space between them filled in. Therefore, it is 
clear that an n-simplex will always be an n-dimensional space. And, from this observa- 
tion, notice that each n-simplex is an n-dimensional space with an (n — 1) dimensional 
boundary. 

We can generalize this overly simplified definition by considering, instead of nicely 
arranged points on the axes of M"^"*"^, n + 1 arbitrary geometrically independenl]^ points 
in for any m > n, which we label Pq, pi, p2, ■ ■ ., Pn (each with their own set of m- 
tuple coordinates denoted x^^ for i = 0, ■ ■ ■ n). Then, we define the simplex over these 
n+1 points similarly to the "overly simplified" definition above. Specifically, for the n + 1 
geometrically independent points po,Pi, • • • ,Pn, the n-simplex over these points, denoted 

(Tn, is 

n n 

X = ^ CjXj,,, Ci> 0, Ci = 1 > (3.2.1) 

i=0 i=0 ' 

We also denote such an n-simplex over these n + 1 points by 

CTn = {PoPl ■ ■ ■ Pn) (3.2.2) 

So, for example, consider two arbitrary points in (we could also use M} if we 
wanted), denoted po and pi. 

^The term "geometrically independent" means that no n — 1 hjrperplane contains all n + 1 points. For 
example a 2-simplex in will contain three points. We demand that no single 2—1 = 1 dimensional 
hyperplane, or line, contain all 2 + 1 = 3 points. 



X G 
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This will create the 1-simplex (poPi) shown below: 
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These will create the 2-simplex (P0P1P2) shown below: 




Next we define the notion of a simplicial complex. A simplicial complex is a set 
of complexes fitted together in such a way as to form a larger space. For consistency we 
demand that, for any n-simplex that is in the complex, all the boundaries of that n-simplex 
are in the complex. This means that you can't have, say, a 2-complex without a boundary: 
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/ 




is not permitted. Nor can you have a 2-simplex without the 0-simplexes that makes the 
vertices: 



is not permitted. However, 
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is permitted. 

Another possible "poorly" constructed complex would be something like 




To prevent things like this, we say that if ai and (T2 are both in the complex, then either 
(Ti n (72 = 0, or (Ti n (72 is a face of (or equal to) ai, or ai fi (T2 is a face of (or equal to) (T2. 
Obviously the complex in the above diagram doesn't fit this requirement. 
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It turns out that, for most spaces X (especially the spaces we will be working with), 
there exists a simplicial complex Kx that is exactly homeomorphic to X. Such spaces 
are called triangulable, and the pair (X, Kx) is call the triangulation of X. Of course 
the triangulation of a space is not unique - for example we could denote in any of the 
following ways: 



Or the sphere in any of the following ways: 
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We also introduce the convention that the simplexes (poPi ■ ■ - Pn) are directed according 
to the order of the vertices. In other words the 1-simplex (poPi) is directed as going /rom 
Po topi. 




It will be the "opposite" of the 1-simplex (pipo) 
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We therefore write 

(PoPi) = -(piPo) 
The 2-simplex (P0P1P2) is also directed as 
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p 

1 



So, here we find 



(P0P1P2) = (P1P2P0) 



-{P1P0P2) 



{P0P2P1) 



(3.2.4) 



This generalizes in the obvious way. 

3.2.3 The Group Structure of Simplexes 

The construction of simplicial complexes in the last section allows us to define a group 
structure. For a complex K, we define the n-Chain Group, which we denote Cn{K), as 
the additive group over Z with elements 



where Ci e Z and an,i is the i*^ n-simplex in K. A single element c e Cn{K) is called an 
n-chain, and addition of n-chains c—^^ Qan^i and c' = ^ ■ c-cT„,i is 




(3.2.5) 




(3.2.6) 



For example we can consider the complex 
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There are five zero simplexes (the five vertices), so we will have cro,i for i = 1, . . . , 5, seven 
1-simplexes, so we have cri^j with i = 1, . . . , 7, and two 2-simplexes, so we have a2,i with 
i = 1,2. So, the elements oi Co{K) will be of the form 



C G Co{K) = Ci(To,l + C2(To,2 + £3^0,3 + C^aQ^^ + C5Cro,5 = CiCTo^i (3.2.7) 



i=l 



So, consider the two elements 



C = 2cro,l - 12(To,4 + ctq.s 

C = -0-0,2 + 4(70,4 - 0-0,5 (3.2.8) 



These will give 



c + c' = 2cTo,i - (ro,2 - 8cTo,4 G Co(ir) (3.2.9) 
Don't loose sleep trying to picture what a given element "looks like" geometrically 



on the graph. For example the first term (c) in (3.2.81 doesn't correspond to two copies 
of cro,i, negative twelve copies of cro,4 and one copy of 0-9,5 or anything like that. We are 
effectually created a vector space with vectors over the field of integers. Each n- 
simplex is analogous to a "unit vector" in the cr„,i direction. There is no deeper geometric 
meaning you should concern yourself with. 
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Our group structure is as follows: 

1) For any c, c' G CJK), we will have (c + c') e Cn{K). 

2) Associativity trivially holds. 

3) The element c = will be the identity element of Cn{K) Vn. 

4) For any element c e Cn{K), the element — c will be the inverse of c. So c — c = 0. 
Notice that because the group operation is additive, this will always be an Abelian 

group. 

3.2.4 The Boundary Operator 

While we do have a nice group structure, we admit this group isn't particularly in- 
teresting. In order to proceed we need the boundary operator. For a directed n-simplex 
Cn = {po: Pi • ■ • Pn), the boundary operator dn, which results in the boundary of cr„, acts as 
follow: 

n 

dnCTn = ^{-'^YiPoPl ' ' ' Pi ' ' ' Pn) (3.2.10) 

i=0 

where the hat symbol " indicates that the element under it should be omitted. 
For example consider the 0-simplex (po) • This is obviously 

doao = (3.2.11) 

In other words, uq has no boundary. This is intuitively clear - a single 0-dimensional point 
has no boundary. 

Next consider ui = (poPi)- This will give 

aiai = (pi) - (po) (3.2.12) 

Looking at cxi again: 
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it is the directed-ness that results in the minus sign. This is simply the "jfinal minus ini- 
tial" points of the directed line element. The point is that the boundary of ai is a linear 
combination of the points which make the boundary of the line - its endpoints. 

For (72 = (P0P1P2)/ we have 

92(72 = (P1P2) - (P0P2) + (PoPi) 

= (P1P2) + (P2P0) + (poPi) (3.2.13) 

Once again this makes sense - the boundary of 
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One consequence of the definition of the boundary operator is the following: 

dn{dn+lCrn+l) = <9n f ^ (" 1)* (Po " " " Pi " " " Pn+l) j 



i=0 
n+1 i-1 

i=0 j=0 
n+1 n+1 

+ E E (-i)^(-ir^po ■■■p^---Pr ■■vn+i) 

i=0 j=i+l 



'.i 

^(-1)'+-' {P9---Pi---Pj--- Pn+l) 



j<i 



j>i 

= (3.2.14) 



In other words d is nilpotent - = 0. Briefly look back at equation (2.3.731, where we 
pointed out that the exterior derivative operator d is also nilpotent. Even though d and d 
are very different, there is a very deep and rich correlation between them that we will be 
exploiting throughout the rest of this series. 

But moving on for now, we point out that the boundary operator acts linearly on 
elements of Cn{K): 

dnC = Cidnan,i (3.2.15) 
i 

And more importantly, notice that in each example above, when dn acts on an n- 
simplex cr„, the result is a linear combination of cr„„i's, which is an element of Cn-i{K). 
So, 

a„ : Cr,{K) Cn-i{K) (3.2.16) 

For example, consider again the 2-dimensional complex K shown here with orientation 
indicated by the arrows: 
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will have two 0-2/3, so an arbitrary element c G C2{K) will be c = Aa2^i + Ba2,2, and its 
boundary will be 



820 = Ad2Cr2,l + -Bf^20"2,2 

= Ad2 {P0P1P2) + B 82 {P1P2P3) 

= A{{piP2) - {P0P2) + (poPl)) + B{{p2P3) - (piPs) + (P1P2)) 

G Ci{K) (3.2.17) 
Also, from the linearity of 

9„(c + c') = Y^ic + c[)dnan, (3.2.18) 

i 

So the group structure of Cn{K) is preserved when mapping down to Cn-i{K) using 9„. 
This means that dn is a homomorphism (cf section 3.1.4[ >. 



All of this means that, for an A^-dimensional simplicial complex K, we can form the 
chain complex, which is the following series of Abelian groups Cn{K) and homomor- 
phism: 

Cn{K) ^ Cn^i{K) Ci{K) ^ Co{K) ^ (3.2.19) 

We denote the chain complex C{K). 

3.2.5 Homology Groups 

Above we defined the Chain Groups, which weren't all that interesting - just multidi- 
mensional additive groups over integers. However the chain complex resulting from the 
boundary operator allows us to create a much more interesting group. 

We can choose a single n-chain c G Cn{K) at any n along the chain complex. It is clear 
from ( 3.2.19[ > that many (but not all) of the c G C„ can be written as the image of something 
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in Cn+i (K) through dn+i, and it is clear that any c G Cn{K) will be mapped to Cn-i{K) by 

dn- 

Consider specifically the set of all chains in c G Cn{K) that are mapped to in 

dnc = (3.2.20) 

Such chains are called n-cycles. We denote the set of all n-cycles in Cn{K) as Zn{K). 
Zn{K) is a subgroup of Cn{K): 

1) If c and c' are in Zn{K) ((9„c = (9„c' = 0), then c+c' will be also (c+c' = 9„(c+c') = 0). 

2) Associativity holds trivially. 

3) The n-cycle is a trivial identity element in Zn{K): 9„ = 0. 

4) If c G Zn{K) (so dnC = 0), then trivially the inverse — c satisfies c) = 0. 



Note that Zq{K) contains only 0-chains, all of which trivially map to (cf (3.2.11 1), and 
therefore Zq{K) = Cq{K). 

As we said above, many c G Cn{K) can be written as the image of something in 
Cn+i{K) mapped by dn+i- This is equivalent to saying that many of the n-simplexes in 
Cn{K) are boundaries of (n + l)-simplexes in Cn+i{K). We want to focus on the c G Cn{K) 
that can be written as the image of something in Cn+i{K) through dn+i- Therefore we de- 
fine an n-boundary as the set of all elements c G Cn{K) that can be written as the image 
of something in Cn+i{K). In other words, if c G Cn{K) can be written as 

c = dn+id (3.2.21) 

for some d G Cn+i{K), then c is an n-boundary. We denote the set of all n-boundaries 
Bn{K). This name is natural - if something in Cn+i {K)\s mapped by the boundary operator 
to c G Cn{K), then it must be that c is the boundary of that something. Hence, it is an n- 
boundary. 



Now, from the nilpotency of d as seen in equation (3.2.141, we make an important 
observation - any n-boundary is an n-cycle. If it is possible to write c as dn+id (with d G 
Cn+i{K)), then 9„c = dn{dn+id) = 0. We have therefore proven the intuitive fact that 
boundaries of things don't have boundaries. We mentioned this fact (without proof) in 
section |3.2.1[ 

However the converse is not necessarily true - if an n-chain c G Cn{K) is an n-cycle, 
that doesn't necessarily meant hat it is an n-boundary. Recall that in section 3.2.1 we 
also mentioned that our interest is in things that do not have boundaries, but are not 
themselves boundaries of anything. In other words, we are interested in chains that are 
elements of Zn{K) but are not in Bn{K). We therefore need some way of looking at Zn{K) 
without Bn{K) "getting in the way". It is now apparent why we have introduced this 
(somewhat boring) chain group structure on K - group theory has a tool which gives us 
this exactly - the factor group (see [40] For a review of this idea). Recall that the idea 
behind a factor group G/ H is that, when if is a normal subgroup of G, then G/H is the 

^Mathematically, given a map / from group Gi to group G2, we call the set of all elements of Gi which 
are mapped by / to the identity element of G2 the kernal of /. We will employ this notation later in these 
notes (and explain it in greater depth), but we are mentioning it now so it's not new later. 
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set of all elements in G with all of H "shrunk" to the identity element. In other words, 
G/H is G with H "divided out". 

Here, we have the group Zn{K) and a subgroup Bn{K): 



B„{K) C ZrXK) 



(3.2.22) 



(which is automatically a normal subgroup because these groups are all Abelian), because 
all boundaries are cycles. Therefore, if we want to look at things that are cycles but not 
boundaries, we can take the factor group Zn{K)/ Bn{K), and we are done! This will give 
us the answer we wanted in section |3.2.11 

We therefore define the n^^ Homology Group, denoted Hn{K), as 



H^{K) = Zr,{K)/B^{K) 



(3.2.23) 



Once again, the idea behind "what this means" is that Zn{K) is the set of all things that 
are mapped to by the boundary operator - in other words the set of all things that 
do not have boundaries, and therefore could be the boundary of somethingj^ We know 
from ( 3.2.14[ > that boundaries of things do not have boundaries. Therefore we can think 
of Zn{K) as effectually finding boundaries for us. However, among things that could he 
boundaries, we want to differentiate between things that are the boundaries of something 
and things that are not. For example 




is a collection of 1-simplexes that could be a boundary if the center was filled in originally, 
or could not be a boundary if the triangle wasn't filled in. Topologically, the more inter- 
esting situation is when the triangle is not filled in (because then we have a non trivial 

^"Recalling that something that could be the boundary of something is anything that doesn't have a 
boimdary itself. 
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loop), and therefore we are more interested in the situation in which the thing that could 
be a boundary is not actually a boundary. In other words, we want something that will 
"alert" us when 




Therefore we use the factor group Zn{K) / Bn{K), which takes the set of all things that 
could be boundaries {Zn{K)), and "collapses" the set of all things that are boundaries 
{Bn{K)) to the identity, because they are topologically trivial anyway (like the solid trian- 
gle above). 

3.2.6 Fundamental Theorem of Homomorphisms 

Before moving on, we mention an extremely important relationship that will help 
simplify calculations of Homology groups. Consider the general situation of a homo- 
morphism / : Gi — > €2- If we take G2 to be a normal subgroup of Gi, then to study 
homology groups, the group we are interested in is the factor group Gi/ker{f), where 



ker{f) is the kernal of / as introduced in the footnote on page 129 which is Bn{K). 



The Fundamental Theorem of Homomorphisms, which we will not prove, says that 

Gi/ker{f) = im{f) (3.2.24) 

where im(f) is the image of / in G2. In other words, im(f) = f{Gi). The equals sign in 
this case means "isomorphic". 

As an example, consider / : Z — > Z2, where 

/(a) = 0, a even 

f{a) = 1, a odd (3.2.25) 

This is clearly a homomorphism (but not an isomorphism). The kernal of / will be every- 
thing that maps to the identity in Z2, which in this case is 0. So, ker{f) = 2Z. The image 
of /, or /(Z), is {0, 1}. So, the fundamental theorem of homomorphisms tells us 

Z/ker{f) = Z/2Z = zm(/) = {0, 1} = Z2 (3.2.26) 

which is what we found in ||40| . This makes sense - the set of all integers mod the even 
integers (Z/2Z) leaves only two possibilities - even or odd. This is a group with two 
elements, or 1 - hence Z2. 



We will make great use of (3.2.241 in calculating homology groups. For example, from 



(3.2.23) {Hn{K) = Zn{K)/ Bn{K)), if we can express Bn{K) as the kernal of some map /, 



then 



H^{K) = Zr,{K)/Bn{K) = Z^{K)/ker{f) = im{f) (3.2.27) 



which is generally much easier to calculate. 

But what type of function must / be? First of all, it must be a homomorphism, or else 



(3.2.24) doesn't apply. Second, for (3.2.24) to apply, / must map from Zn{K) to a normal 
subgroup of Zn{K) (and because Zn{K) is Abelian, any subgroup is normal). The trick is 
to make sure that if c G Bn{K) C Zn{K), then /(c) = 0. We will discuss how this is done 
in the examples below. 
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3.2.7 Examples of Homology Groups 



Consider the 0-simplex K = (po)- For this we have Co{K) consisting of all chains c of 
the form co(po) with cq e Z. This is therefore isomorphic to the group Z with addition. 
Obviously doc — for any c e Co{K), so Zq{K) — Co{K). Then, because there is no Ci{K) 
in this case, we take Bo{K) to be - the identity element. So, we have 

Ho{K) = ZoiK)/Bo{K) = Co(X)/{0} = Z/{0} = Z (3.2.28) 

It is easy to show that Hn{K) = for n > (we let you work this trivial result out 
yourself). 

Now consider the complex X = {(po), (pi), (P2), (poPi), (P1P2), (P2P0), (P0P1P2)}, a solid 
filled-in triangle 




Co{K) will consist of elements c such that 

c = A{po) + B{p,)+C{p2) (3.2.29) 

where A,B,C e Z, and therefore Co(i^) = Z © Z © Z. To find Zo{K) we find all of the 
0-cycles: 

do{A{po) + B{pi) + C{p2)) = (3.2.30) 

and therefore Zq{K) = Cq{K) = Then, Bq{K) will be the set of all chains that 

satisfy 

d2{D{poPi) + E{pip2) + F{p2Po)) 

= ^((Pi) - {Po)) + E{{p2) - (pi)) + F((po) - (P2)) 

- (F - D){p,) + {D- E){p,) + {E- F){p2) 

= D'{po) + E'{p,) + F'{p2) (3.2.31) 
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where D,E, F G Z. The first line represents the boundary operator 82 acting on an ar- 
bitrary 2-chatn, and the third line is the image of 82- So, any c G Co{K) of the form in 
the third line here will be a 0-boundary, and therefore in Bq{K). Notice that, as we have 
redefined the coefficients of the 0-chatns, they satisfy 



- D' -E' = F' F' + E' + D' = 

So, solving for F', an arbitrary element of Bq(K) will be of the form 

D'{po) + E'{p,)-{D' + E'){p2) 



(3.2.32) 



(3.2.33) 



Taking Hq{K) = Zq{K) / Bq{K) directly as we did above is not an obvious calculation. 
Therefore we use the fundamental theorem of homomorphisms ( 3.2.24[ >. As we discussed 
at the end of section 3.2.6 we want a homomorphism / which maps Zq{K) to a normal 
subgroup of Zq{K) such that the kemal of / is Bn{K). 

The way we should construct / is revealed by (3.2.321. We know that Zq{K) = Co{K) 
is isomorphic to Z © Z © Z by equation (3.2.30) ff. Bq{K) also appears this way in the 



last line of (3.2.31). However Bq{K) actually has the addition constraint of (3.2.321. And, 
because ( 3.2.32| ) is a constraint over Z (each term in (3.2.32) is an element of Z), we can 
define / : Zq{K) — > Z as 



f{A{po) + B{p,) + C{p2)) = A + B + C 



(3.2.34) 



Now any element of Zq(K) that is mapped to zero (in other words, is in the kernal of /) 
will satisfy 



A+B+C=0 



C = -iA + B) 



(3.2.35) 



which defines exactly an element of Bo{K) (cf equation ( 3.2.33[ )). 
So, 

Ho{K) = Zo{K)/Bo{K) = Z^{K)/ker{f) = tm{f) = Z 



(3.2.36) 



(the image of / is clearly seen to be Z from (3.2.34)). 

Next we want to find Hi{K). First we find Zi{K) - f or c G Ci{K), 

dic = di{A{popi) + B{pip2) + C{p2Po)) 



{C - A){po) + iA- B){p,) + (5 - C){p2) 



(3.2.37) 



(we borrowed the result from ( 3.2.31[ )) where, as usual, A,B,C G Z. This will be only in 
the case 



A = B = C 
Therefore, a general element of Zi{K) is 

^((PoPi) + (P1P2) + (P2P0)) G Zi{K) 



(3.2.38) 



(3.2.39) 
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Now to find Bi (K). This will be an element that is the boundary of a two simplex: 

d2D{poPlP2) 



D{{piP2) - {P0P2) + {poPi)) 
D{{poPi) + {P1P2) + {P2P0)) 



(3.2.40) 



where D E Z. Comparing (3.2.39) and (3.2.40) we see that they are the same. Therefore 

B,iK) = Z,{K) (3.2.41) 

which means 

H^{K) = Zi{K)/Bi{K) = (3.2.42) 

where is the identity element in the group. 

We leave it to you to show that Hn(K) = for n > in this case. 

As a third example, consider the complex K = {{po), (pi), (^2), (poPi), (P1P2), {P2P0)}, a 
triangle without the inside filled in. 




An identical argument to the one above gives 

Ho{K) = Z (3.2.43) 

for this complex. 

Hi{K), however, will be different. First, Zi{K) will be the same as in (3.2.371, (3.2.381, 
and (3.2.39), and therefore isomorphic to Z. But unlike before, Bi{K) = here because 
there is no 2-simplex to take the boundary of as in (3.2.40). So, 

Hi{K) = Zo{K)/{0} = Zo{K) = Z (3.2.44) 
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As a few final examples which we leave to you to work out, the triangulation 
of (an empty tetrahedron), or K = {(po), (pi), (P2), (pa), (poPi), (P0P2), (poPs), (P1P2), 
(PiPa), (P2P3), (P0P1P2), (P0P1P3), (P1P2P3)}, will give 

Ho(K) = Z Hi(K)^0 H2(K) = Z (3.2.45) 

and all higher groups Hn{K) = 0. 

A triangulation of the Mobius strip (you can write out K yourself) produces 

Ho{K)^Z Hi(K)^Z H2{K)^0 etc. (3.2.46) 

The torus produces 

Ho{T^) = Z Hi{K) ^Z®Z H2{T^) = Z (3.2.47) 

The cylinder C produces 

Ho{C) - Z Hi{C) - Z Hn{C) = 0, Vn > 1 (3.2.48) 

A triangulation of the figure eight: 




will be 

Ho{K) = Z Hi{K) ^Z®Z H^{K) = 0, Vn > 1 (3.2.49) 
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3.2.8 Meaning of the n^^ Homology Group 

As we explained in the introduction to this section, the basic idea of homology is to 
find something that could he a boundary, but isn't one - this indicates a hole. The simplest 
example is two distinct points - they could be the boundary of a line. So for a line, the two 
points on the ends both could be and are boundary points. However, two disjoint points 
with no line between them could be and are not boundary points. 

This is the idea behind homology groups. The group Hn{K) looks for "n-dimensional 
holes" in a space by taking all of the things that could be boundaries {Zn{K)) and mod- 
ing out all the things that actually are {Bn{K)). What is left, Hn{K), is a record of the 
n-dimensional holes. 

For example, a one-dimensional hole is a circle S^. Therefore any space with a one- 
dimensional hole will have a non-trivial Hi{K). A two-dimensional hole is S"^, and there- 
fore any space with a two-dimensional hole will have a non-trivial H2{K) (cf equations 
(|3;Z45])-(|3i49|)). 



Another way of thinking about this is as follows: for a one-dimensional hole, which 
is like S^, imagine a circle living in the space, but it will move itself around to try to 
collapse itself down to a point. If the space is such that there is some way a loop can live 
in it without being able to find a way of contracting itself down, then the Hi homology 
group is non-trivial. Obviously , the Mobius strip, and the figure eight all satisfy this 
(the figure eight had two ways of "supporting" an in this way), whereas the other 
examples considered above did not. 

Of the spaces we considered as examples above, only the tetrahedron (which was 
homeomorphic to S"^ itself) had a non-trivial H2 group. Once again this makes intuitive 
sense. 

The Hq groups are less obvious. In essence Hq{K) will be non-trivial whenever the 
space can separate two individual points (5*°). All that is required for this is for the space 
K to contain a single point. Therefore Hq{%) = 0, but for any K ^ 0, Hq{K) 7^ 0. In fact 
the following result can be proven: 

Hq{K) = Z (3.2.50) 
for any connected complex K. If K is not connected, then 

Ho{K) = Z®Z®---®Zptimes (3.2.51) 

where p is the number of disconnected components of K. For example, if K is three 
disjoint points, 

Ho{K) = Z®Z®Z (3.2.52) 
For the general Hn{K) group, the basic form will be 

Hn{K) = Z®Z®---®ZpUmes (3.2.53) 

where now p is the number of n-dimensional holes in 

As a note to more mathematically inclined readers, we are omitting discussion of the torsion subgroups 
for now. 
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We recognize that this discussion has left a tremendous amount to be desired, and 
anyone already familiar with the topic is likely pulling their hair out due to the lack of 
rigor and often somewhat careless treatment of concepts. We apologize for this, but want 
to reiterate that our only goal is to give the reader a small picture of the basic ideas behind 
these topics in algebraic topology. Not a comprehensive (or even close to comprehensive) 
introduction. We have done our best to maintain (pedagogical) clarity even if that has 
resulted in a lack of completeness. We will address these ideas again later, after we have 
developed more sophisticated concepts, tools, techniques, and vocabulary. 

3.2.9 Connection Between Homology and Homotopy 

Before moving on we take a moment to consider how homotopy groups and homol- 
ogy groups relate. You no doubt noticed that there is some similarity between them. To 
emphasize this we summarize several of our results so far in the following table. 





7ri(X) 




Disk 


1 


1 


Circle 


Z 


Z 


Torus 


zez 


zez 


Cylinder 


z 


z 


2-Sphere 


1 


1 


Mobius Strip 


z 


z 


Figure Eight 


Generated by gi, g2 (non-Abelian) 


z©z 



Notice that, in all cases, the fundamental group (tti) and the first homology group (Hi) 
is the same except for the figure eight. We will address the case of the figure eight shortly. 

For the other six spaces, the reason for the similarity is that 7r„ and Hn are essentially 
looking for the same thing - n-dimensional "holes" in the spaces (for clarity, we will fo- 
cus on TTn and for n > only. In the cases above, we are specifically considering 
1 -dimensional holes, which are equivalent to circles. It is clear by inspection that a disk 
and a 2-sphere do not have any holes such that a circle cannot be retracted to a point. This 
is why both vri and Hi are trivial for each of them. 

On the other hand, the circle, the cylinder, and the Mobius strip all have a single loop 
in them which would prevent a circle from being contracted, and therefore tti = Hi for 
all of them. 

The torus, as we pointed out above, has two holes in it, and therefore tti and Hi both 

have two copies of Z. 

You can effectively think of the Z's in each corresponding to a hole in the sense of the 
number of times you can wrap around that hole. 

Higher homotopy groups (7r„) and homology groups (if„) are similar. They are both 
looking for the same things. The difference is the group structure they assign based on 
what they find. 

So what about the figure-eight? Why are tti and Hi different in this case? Furthermore, 
notice that Hi for the figure-eight is equal to Hi for the torus. Yet, while tti for the torus is 
equal to Hi for the torus, tti for the figure-eight is unique. 
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The difference is in the nature of the groups generated. As we mentioned above, the 
general form of Hn{K) is (3.2.531. We can classify such groups more generally. 

Consider some group G with elements gi. We will take the group operation to be ^. Let 
us assume that G is a group such that there exists a finite set of n elements hi, . . . ,hn G G 
which act as generators of the group. This means that any element can be written as 
some product of these n elements: 



9i = h\'h 



«1 /,«2 



hi 



(3.2.54) 



where hj = hi -k hi -k ■ ■ ■ -k hi, j times. We briefly discussed this when we talked about 



the homotopy group of the figure eight on page 106 There there exist a finite set of such 



generators, we say that G is finitely generated. If the generators are linearly independent, 
we say they are free generators. If the generators are linearly independent and free, then 
G is necessarily Abelian, and we call G a finitely generated free Abelian group. The 
number of generators n is called the rank of G. In simpler terms, if G is finitely generated 
by n linearly independent generators, we call G a free Abelian group or rank n. 

For example, consider the group Z © Z with addition. An arbitrary element can be 
written as (a, h), where a,b G Z. This group is obviously infinite, but we can write any 
element in terms of the generators 



hi = {1,0) /l2(0,l) 
Then, the group element (a, b) is written as 

{a,b) = hlhl 



(3.2.55) 



(3.2.56) 



(recall what the exponents mean here, cf (3.2.54) ff, and that the group operation -k here is 
merely addition, +). 

So, as the chart above indicates, i/i (figure eight) = Z © Z. This is this group exactly 
- the free Abelian group of rank 2. We pointed out above than any arbitrary homology 
group can be written as in ( 3.2.53[ >, which are all free Abelian groups, where the rank (the 
number of generators) is equal to the number of holes. So, all homology groups are free 
Abelian groups. 

The reason for this is that the definition of the n*'^ homology group Hn{K) is given 
in terms of Zn{K) and Bn{K) (see (3.2.231). Notice that, by definition, Zn{K) and Bn{K) 
are both free Abelian groups. Therefore, as the factor group of a free Abelian group, it is 
obvious why Hn{K) is a free Abelian group. 



Furthermore, notice from the table above (and from section 3.1.31 that tti (figure eight) 



is the non-Abelian group with two generators. The reason for this is again because of the 
nature of the definition. Homology groups are defined by free Abelian groups {Zn{K) and 



Bn{K)), whereas homotopy groups are defined by loops, which (as we saw in section 3.1.3 1, 
aren't always commutative. Therefore it is possible to have non-Abelian fundamental 
groups. The figure-eight is only one example of a space with a non-Abelian fundamental 
group. Of course, the torus is an example of a space with a fundamental group that is 
"bigger" than Z, but is still Abelian. 
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As a final comment, there is a general relationship between tti{K) and Hi{K) where, if 
you have tti, it is straightforward to calculate Hi. First define the commutator subgroup, 
denoted F of a group as the set of all elements of the form 

aba-^b-^ (3.2.57) 
If we take the (generally non-Abelian) group tti (K) and then find the factor group 

iTi{K)/F (3.2.58) 
then we have Abelianized ni(K). Recall that the meaning of the factor group G/H is to 



contract everything in H to the identity. So, (3.2.58 1 is setting everything in the commuta- 



tor subgroup equal to the identity, or 1. In other words, we are making every element 
commute]^ And if every element commutes, we are "Abelianiztng" the non-Abelian 
group TTi. 

For example the fundamental group of the figure-eight was the non-Abelian group 
with two generators. If we call this group G and the commutator subgroup F, then we 
are effectually taking everything that doesn't commute in tti and making it commute. 
What will be left is an Abelian group with two generators, which is naturally written as 

Z©Z. 

More generally, for any fundamental group ni(K) and its commutator subgroup 
Ftti{k)> we have the relation 

ni{K)/F^,^K) = Hi{K) (3.2.59) 

We have been extremely cursory in this section. If you don't feel confident about the 
ideas we have discussed here, don't worry. We will readdress everything again in greater 
detail when we need it. We are only introducing these ideas here for completion. 

3.2.10 The Euler Characteristic and Betti Numbers 



There are two final ideas we need to discuss in this section, and we will do so very 
briefly. Before bringing homology into focus here, we first introduce a related idea, the 
Euler characteristic of a space. 

For any simplicial complex K, with simplexes cr„, we can define the number function 
of simplexes such that N(an) equals the number of n-complexes K. We can then define 
the Euler Characteristic, also called the Euler Number, of the space as 

oo 

X{K) = ^(-l)W(a.) (3.2.60) 

i=0 

For example consider the filled in triangle 

^^Don't be confused by the difference in notation between 1 for the identity and for the identity. We are 
speaking in the abstract here. Contracting F "to the identity" simply means that we are making everything 
commute. 
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there are N{ao) = 3 zero-simplexes, N{ai) — 3 one-simplexes, and N{a2) — 1 two- 
simplexes. So, the Euler characteristic is 

oo 

X(triangle) = J](-l)*7V(aO 

i=0 

= N{ao) - N{ai) + N{a2) - N{as) + N{a^) - N{a5) + ■■■ 
= 3-3 + 1-0 + 0-0 + --- 

= 1 (3.2.61) 

How you triangulate a given space doesn't affect its Euler characteristic. The triangle 
above is a triangulation of a disk. We could also triangulate the disk as 
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Now, 



X(triangle) 



Or, a triangulation of a circle: 



N{ao) - N{ai) + N{a2) 

6-9 + 4 

1 




(3.2.62) 



will be 

x(^') = 3-3 = 
Or, another triangulation of a circle 
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(3.2.63) 



will give 



12 - 12 = 



(3.2.64) 



It turns out that the Euler characteristic of a space, which is inherent to the space itself 
and not to how you triangulate it (in other words, the Euler characteristic of a disk is 
always 1, and the Euler characteristic of a circle is always 0), is a topological invariant. 

And, it turns out that the Euler characteristic is tied to the homology groups. To see 
this relationship, we define the Betti Number bn{K) of a space K as the dimension of the 
homology group. In other words, hn{K) is the rank of the free Abelian group Hn{K). 



n 



th 



In other, other words, the Betti number bn{K) is the number of topologically non- trivial 
n-dimensional spaces in K (hi is the number of non-trivial loops, 62 is the number of 
non-trivial ^^'s, etc.). 



For example, in the examples in section 3.2.7 we can make the following table 





Ho 


Hi 


H2 




hi 


h2 


Point 


Z 








1 








Disk 


Z 








1 








Circle 


z 


z 





1 


1 





Sphere 


z 





z 


1 





1 


Mobius Strip 


z 


z 





1 


1 





Torus 


z 


z®z 


z 


1 


2 


1 


Cylinder 


z 


z 





1 


1 





Figure Eight 


z 


z©z 





1 


2 






The relationship between the homology groups and the Euler characteristic is given 
by the Euler-Poincare Theorem, which states 



(3.2.65) 



i=0 
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In other words, the Euler characteristic is equal to both the alternating sum of the number 
of simplexes, and the alternating sum of the Betti numbers. 

While there is drastically more we could say about homology, we will stop here for 
now. We will address these ideas again later in this series when we have more mathe- 
matical "machinery" to work with. For now we trust you have the basic ideas of what 
homology is, and we move on to our final topological idea, cohomology. 

3.3 Cohomology 

3.3.1 Introduction to Cohomology 

So far in this chapter we have discussed homotopy and homology. We saw that they 
effectively "looked for" the same things - ri-dimensional holes. The difference between 
them is the particular group structure they assign to the holes they find. Homotopy as- 
signed (generally) non-Abelian groups, while homology assigned free Abelian groups. 

We discussed several examples of both homotopy groups (especially fundamental 
groups) and homology groups for a variety of simple spaces. As we mentioned several 
times above, calculating general homotopy and homology groups can be extraordinary 
difficult. Imagine trying to triangulate or picture loops in an 11 -dimensional space! There 
are a number of tools and techniques which allow for the calculation of homology and 
homotopy for more complicated (and interesting) spaces, but we don't have the math we 
need to consider them at this point. Our only goal here has been to introduce the basic 
ideas. 

But, in introducing these basic ideas, it has no doubt crossed your mind more than 
once that what we have been doing in this chapter seems to have little (if any) correlation 
to what we did previously with differentiable manifolds and all of the various operations 
on them. This observation is true on the surface - there is no obvious correlation between 
the topological considerations of this chapter and the geometric /analytic considerations 
of the previous. 

However, this is not ultimately true. The ideas contained in this chapter and in the 
previous are actually related in extremely profound, rich, and deep ways. We are now 
in a position to see this correlation. Admittedly, nothing we have been doing with the 
(global) topological properties of the spaces we've considered in this chapter have looked 
like the geometric and analytic tools from above - we haven't been looking at forms or 
pullbacks or vector fields when considering loops and free Abelian groups. This has been 
part of the reason that no connection is apparent. 

Cohomology provides the bridge between chapters |2] and |3j We will see that, just as 
the name implies, cohomology is indeed dual to homology, just as a covector is dual to 
a vector. This section will not only begin to look familiar, but also begin to tie together 
everything we have done so far. 

We will approach cohomology from a slightly different perspective than is standard 
in texts. We will build it through direct analogy with homology, hopefully generating the 
ideas in a pedagogically logical way. Once we have built the necessary motivation and 
(hopefully) intuition regarding what cohomology is, we will introduce the more rigorous 
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ideas behind it, as well as discuss how exactly is the "dual" of homology. 

3.3.2 The Nilpotency of d and d 

In the previous section, we saw that the boundary operator, which we defined by 
( |3.2.10[ ), is nilpotent (cf. ( |3.2.14| |): 

(9^ = (3.3.1) 



Also, in section 2.3.7 we also saw that the exterior derivative operator, which we de- 
fined in (2.3.721, is nilpotent as well (cf. (2.3.73)): 

d^ = (3.3.2) 

Recall that 9 is a map 

d : Cn{K) Cn-i{K) (3.3.3) 

And that is a map 

d : A^'TpM — > A''+%M (3.3.4) 

In both cases we have a nilpotent operator that moves us from one dimensionality to 
another. 

With the boundary operator d, we considered two types of n-chains c: those that were 
"cycles", meaning that they had no boundaries: dc = (denoted Zn{K)), and those that 
were "boundaries", meaning that they could be written as the boundary of an (n + 1)- 
object: dd = c (denoted Bn{K)). We then looked for all of the cycles that were not bound- 
aries, Zr,{K)/Bn{K). 

What made this work was specifically the nilpotency of d, which ensured that 
Bn{K) C Zn{K). In other words, if a chain c is a boundary, then c = dd, and there- 
fore dc = d{dd) = d^d = 0. We were looking at the things in Zn{K) {dc = 0) that are not in 
Bn{K) {dc = 0, but not because c = dd). 

It is natural to do the exact same thing with d. 

3.3.3 Cohomology Groups 

We will build up the definition of a cohomology group by "mere analogy" with ho- 
mology groups. We will then go back and explain what exactly they are/ mean. 

Consider the set of all n-forms ujn on some manifold M.. We can define the n-Chain 
Group, which we denote C"(A^), as the additive group over M with elements 

c = ^ dun^i (3.3.5) 

i 

where c* G M and the i subscript on c<j„ is merely an index labeling a particular n-form. 



The linearity and group construction follows exactly from section 3.2.3 
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Notice that the coefficients c' are now in M, not in Z as with homology. This is simply 
because it is more natural to use integers for the coefficients of simplexes and real num- 
bers for the coefficients of forms (multi-dimensional functions). We are using superscripts 
on the coefficients to emphasize that this will be the dual construct to homology. 

Next we use the exterior derivative in a similar way to how we used d with homol- 
ogy. We say that an n-form tOn on ^A is an n-cycle (also called an closed n-form) if it 
satisfies 



dujn = (3.3.6) 

We denote the set of all closed forms Z" ( ) . We say an n-f orm Un on Ai is an n-boundary 
(also called an exact n-form) if it can be written globall^^as the exterior derivative of an 
(n — l)-form: 

Un = dujn-i (3.3.7) 

We denote the set of all exact forms B^{M). 

Obviously, because of the nilpotency of d, any exact n-form is closed (or, any n- 
boundary is an ri-cycle). In other words, if cUn = dcUn-i, then 

dUn = d{dlUn-l) = CpUJn-1 = (3.3.8) 

and therefore 



S"(X) C Z'^iM) (3.3.9) 



(compare this to (3.2.22 1). Again, we are using superscripts on and 5" intentionally to 
emphasize that these will form the dual of the homology groups. 

Again, because 5" and are Abelian, 5" is automatically a normal subgroup of Z", 
and we have the a natural form of the n*^ Cohomology Group: 

if"(A^) = Z'\M)/B'\M) (3.3.10) 

The definition of the homology groups was a bit easier, because the group structure 
was easier to visualize. Making sense of this will require a bit of review. 

3.3.4 Examples of Cohomology Groups 

As a quick preliminary note, we should point out that H'^{M.) is actually a vector space, 
not really a group. The factor group Z"' / B"^ is a set of equivalence classes on Z", which 
form a vector space. We have assigned an additive group structure, but if" is still formally 
a vector space. However the term "Cohomology Group" is so embedded in the literature 
that any attempt to change it would only be confusing (and pointless). We will therefore 
stick with convention and refer to "cohomology groups". We make this point just for 
clarity. 

^^The requirement that it be written globally is vital. This means that we can write w„ = dujn-i over the 
entire manifold - there is no point where it fails to hold. 
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Even though we don't have at this point a good intuitive understanding of what a 
cohomology group is, we will still calculate a few simple examples of them. The next 
section will discuss in more detail what they mean. 

First consider = M. First we find if°(M). The set i?°(M) has no meaning because 
there are no -1 forms. So H^{R) = Z°(M). The set Z°(R) is the set of all 0-forms / that are 
closed, or df — 0. The only way to have df — is ii f is constant. And the set of all constant 
functions, or 0-forms / will be isomorphic to R. So, = Z°{R) /B^(R) = R/{0} = R. 

It turns out that, as with Ho{K), as long as M. is connected, H^{M) will be isomorphic 
to M. If M. has n connected components, then if°(7W) will be isomorphic to n copies of R, 
or M e R ® • • • e R (n times). 

Next we find H^{R). First, we know that because R is 1 -dimensional, any 1-form on R 
is closed. Furthermore, we can take any 1-form fdx and integrate it: 



so that 



fdx (3.3.11) 



f^dF (3.3.12) 



And therefore all 1 -forms are exact. So, all one forms are closed and all one forms are 
exact. Therefore 

H'^{R) = Z\R)/B\R) = {{)} (3.3.13) 

the identity. It is straightforward to show that if"(]R) = for n > 2. 

As another example, consider M — S^. We can repeat a similar analysis as before and 
find that 

H°{S^) = R 
H\S^) = R 

//"(^i) = {0}, n>2 (3.3.14) 



And for M = S^ 



H%S^) = R 

H\S^) = {0} 

H''{S^) = R 

m{S^) = {0}, n>3 (3.3.15) 



And for M = S^: 



H^S"") = R 
= R 

H^{S^) = {0}, jVO,n (3.3.16) 

And so on. So once again, we see that the n*'^ cohomology group "detects" n- 
dimensional holes in exactly the same way as the n*^ homotopy group or the n*'* ho- 
mology group. Cohomology is quite remarkable - a purely (local) analytic statement - the 
types of forms that can exist onM- giving us (global) topological information about M. 
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3.3.5 Meaning of the n*'* Cohomology Group 



The n*^ homology group H^iK) was defined as H^iK) = Z^{K)/B^{K). We could 
"picture" this as the set of all closed cycles without the set of all boundaries. We thought 
of the factor group Zn{K) / Bn{K) as the set of everything in Zn{K) with everything in 
Bn{K) "contracted to the identity". This picture doesn't work as well for cohomology, 
where Zn{M.) is the space of all n-forms on A^. So to understand this more clearly, we 
briefly review factor groups. 

Consider the Abelian group Z = {0, ±1, ±2, ■ ■ ■ } with addition. We can take the nor- 
mal subgroup of all elements of Z of the form An, where n E Z. We also write this as 
4Z = {0, ±4, ±8, ±12, ■ ■ ■ }. Obviously 4Z obeys closure: 

4n + 4m = 4(m + n) = Ap peZ (3.3.17) 

it is associative, there is an identity (0), and an inverse {{An) + {—An) = 0). 

We can find the factor group Z/4Z by using the definition in [40 J. An arbitrary element 

of Z/4Z is 

n(4Z) =n{0,±4,±8,±12,---} = {n + 0,n±4,n±8,n±12, ■■■} neZ (3.3.18) 
So we can write out: 



0(4Z) 


= {0 + 0,0±4,0±S 


,0±12,- 


■■} = 


{0,±4, ±8,±12,---} 


1(4Z) 


= {1 + 0,1 ±4,1 ±5 


,1±12,- 


■■} = 


{1,5, -3,9, -7,13, -II,---} 


2(4Z) 


= {2 ±0,2 ±4,2 ±5 


,2±12,- 


■■} = 


{2,-2,6,-6,10,-10,14,---} 


3(4Z) 


= {3 ±0,3 ±4,3 ±S 


,3±12,- 


■■} = 


{3,7,-1,11,-5,15,-9,---} 


4(4Z) 


= {4 ±0,4 ±4,4 ±5 


,4±12,- 


■■} = 


{4,0,8,-4,12,-8,16,---} 


etc. 











(3.3.19) 



Comparing the far left and right sides of these, we see that 0(4Z) and 4(4Z) are the 
same. You can continue writing these out and you will find that any element of the form 
(4n)(4Z) will be the same, where n G Z. Furthermore, you will find that 1(4Z) and 5(4Z) 
will be the same, as will any element of the form {An ± 1)(4Z). All elements of the form 
{An ± 2)(4Z) will be the same, and all elements of the form {An ± 3)(4Z) will be the same. 
In other words, the factor group Z/4Z breaks Z up into four equivalence classes. There is 
the equivalence class represented by 0, which we can denote [0], by 1 (denoted [1]), by 2 
(denoted [2]), and 3 (denoted [3]). This group is isomorphic to Z4, the integers mod 4. 

You can write this out for any subgroup nZ C Z and find that it breaks Z into n 
equivalence classes. 

You could also take the group M under addition with the normal subgroup Z, and 
the factor group M/Z will break every element of Z into equivalence classes. It would 
be instructive to write this out and see that there are an infinite number of equivalence 
classes in this case, all parameterized by the real numbers from the interval [0, 1). An 
element a E [0, 1) will be equivalent to any element a + n where n E Z. We can reword 
this to say that two elements a,b E Z are equivalent in M/Z if a — 6 G Z. 
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Notice that the same can be said of Z/4Z - two elements a,b E Z are equivalent in 

Z/4Z if a-6 G 4Z. 

We can generalize this to arbitrary (additive) groups. For any group G with normal 
subgroup H, two elements gi, gj E G are equivalent in G/H ii a — b E H. This definition 
of G/if is equivalent to the ones given before. 

This way of thinking about factor groups, where the subgroup H defines an equiva- 
lence class of elements of G, and these equivalence classes are the elements oi G/ H, will 
be the most useful way of thinking about cohomology groups. 



So, looking at the definition of the n*^ cohomology group ( |3.3.10[ >, we have the group of 
all closed forms Z^{M.) and the normal subgroup B'^{M.) of all exact forms. And while 
an exact form is necessarily closed, a closed form is not necessarily exact. This is what 
H^{M.) is measuring - it is finding all forms that are closed and "collapsing" the ones that 
are exact to the identity. 

Or in other words, B^\M.) creates an equivalence class of forms in where two 

chains c, c' E Z^'^M) are equivalent in Z'\M)/B''{M) if (c - c') E B^'iM). We can speak 
in terms of forms (rather than linear combinations of forms, or chains), and say that two 
forms are equivalent in Z^ (Ai) / B"■{^A) if they differ by an exact form. 

So, Un and Un + du'n_^ are equivalenl|^ in Z'^{M.). An important consequence of this 
(as we will see) is that the exterior derivatives of two equivalent forms are equal: 

= Un + duj'^_-^ =^ du'^ = d{ujn + duj'n_i) 



n 



= duJn + d^uj'^_^ 

= dun (3.3.20) 

If two forms differ by an exact form in this way, they are called cohomologous. 
Incidentally, we could have thought of homology in this way. The factor group 

Hr,{K) = Zr,{K)/Br,{K) (3.3.21) 

divides Zn{K) up into similar equivalence classes. Two n-simplexes cr„ and a'^ are related 
such that their difference is an [n + l)-boundary 

< = a„ + (3.3.22) 

then cr^ and cr„ are homologous. Obviously the boundary of cr" and cr„ are the same: 

= dnCTn (3.3.23) 

However, unlike with the cohomology case, we can draw a picture of what is happening 



in (3.3.23). Let a" and (T„ each be standard 2-simplexes (triangles). 



14' 



The prime does not indicate any sort of derivative - it is only notational. 
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and let cr'n^i be a standard 3-simplex (a filled in tetrahedron). Then, a[[ will again just be 
the triangle, while cr^ + dn+icr'n_^_i will be a triangle with the boundary of a tetrahedron 
attached: 




Clearly the "empty tetrahedron" attached to (T„ + dn+icr'^^i contributes nothing to its 
boundary, and the only boundary of (T„ + dn+icr'^^i is that of the triangle. Therefore 

dna'^ = dnan- 

We don't mean to get sidetracked with this rehashed discussion of homology. We 
are merely trying to give some intuition regarding what we are doing with the exterior 
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derivatives and cohomologous forms. We know from the previous section that if" detects 

"holes" in - the question is 'how?' 

To see this more explicitly, consider the manifold M = M.'^, and the form 



First of all 



uj - -^^dx + -^^dy (3.3.24) 
x"^ + + 



^ (^^^d..d.x+^(- ^ 



x"^ + y^ — 2|/^ 



(2;2 _|_ _ 2x^\ 



(x^ + y'^y 

y ^ dxdy - ( (^2 + ^^2)2 ) ^^^^ 



= (3.3.25) 
(notice the order of the differentials in each line) So uj is closed. Also, consider the function 

=tan-^ 1^^^ (3.3.26) 

It is straightforward to show 

uj = dF (3.3.27) 

So we are tempted to say that u is also exact. However F is not defined on all of = R^, 
and we mentioned above that "exact" means that it must be globally expressible as the 
exterior derivative of another form. Here, it breaks down at a; = 0. 

But consider M — — {0}. We can now repeat the same thing as before and we 
find that F{x, y) is well-defined on all of M. So, we have a form that is closed on R^ and 
closed on R^ — {0}, but is only exact on R^ — {0}. This is a simple example of how H'^{M) 
is able to detect holes in 7W . When the origin is included u is not exact because it can't be 
globally written in terms of F. When the origin is not included it can be written globally 
as dF. So we see that the differential structure has given us topological information. 

Once again, there is much more we could say about cohomology and how cohomology 
groups can be calculated, what they mean, etc. However we postpone such discussions 
until later in this series. For now, understanding the basic ideas - that the n*'* cohomology 
group H^{M.) detects n-dimensional holes in - is all that is necessary. We will dive 
back into this once we have the necessary mathematical machinery. 

3.3.6 Cohomology, Integration, and Stokes Theorem 

To introduce this section we point out an important difference between homology and 
cohomology. Namely, notice that homology deals with spaces, while cohomology relates 
to forms defined on spaces. Rewording this, cohomology relates to what you integrate, while 
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homology relates to what you integrate over. We will see that this fact is what makes 
homology and cohomology dual ideas. 

To understand this relationship we need to do some preliminary work. We define the 
standard n-simplex as a simplex in {not ]R"+^ as before) with with the points 

Po = {0,0,...,0} 
pi = {l,0,...,0} 



p„ = {0,0,...,l} (3.3.28) 
In other words, the standard n-simplex (denoted (t„) is 



(Jr. 



|(x^a;^...,x") e W\x' > 0, ^a;' < l| (3.3.29) 



(We will generally drop the word "standard" when talking about standard simplexes, 
and use the term "simplex" and "standard simplex" interchangeably) Compare this ex- 
pression carefully to ( 3.2. 1[ > especially the last "summation" constraint at the end. There 



the definition put a 0-simplex at the number 1 on the real line, the 1-simplex "diagonal" 
in M^, the 2-simplex in M^, etc. 

This definition is slightly different - the simplex is defined in ]R° - the point. The 
1-simplex is defined in M}, the 2-simplex is defined in M^, the 3-simplex is defined in M'^: 




Notice that in each case the general shape is the same - they are just situated in the Eu- 
clidian space differently. 

It is natural to think of these simplexes as differentiable manifolds. They have a 
boundary, which we haven't discussed yet, but that won't be a problem for us. The point 
of thinking of them as manifolds is that we spent the last chapter talking about how to 
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define things on manifolds - vectors, forms, integration, etc. We are only thinking of 
these are manifolds for continuity with that chapter - so that we can define integrals on 
them. Obviously the coordinate functions for these simplexes are trivial because we have 
defined them in M" - the coordinates are simply the coordinates we defined them in. 

It should be clear from the definition that (t„ is an orientable manifold. So it is natural 
to define the volume form (cf section 2.3.8) 

(l)v = hdx^ Adx^ A--- A dx" (3.3.30) 

on them. The function h depends on the coordinates chosen. We can then integrate the 
volume form over the simplex: 

/ 0y = / hdx^ Adx^ A--- Adx'' (3.3.31) 

in the same way as as in section 2.3.8 We used (7„ to describe both the manifold on the left 
hand side and the "open subset of M"" on the right hand side simply because we defined 
the manifold in M" to begin with. 

Note that if we use Cartesian coordinates, the form of this integral will be (from equa- 
tion ( |3.3.29| » 

hdx^ Adx"^ A-- - A dx"" = dx^ dx"^ / dx^ ■ ■ ■ 



(T„ Jo 



dx" h (3.3.32) 



So, we have n-simplexes and volume forms defined on them. Now we want to "put" 
these on some manifold Ai. We therefore define a map 

f -.an — >M (3.3.33) 

Just as (T„ is called an n-simplex in M", the image f{an) G 7W is called the singular n- 
simplex in Ai. We denote this image 

f{an) = Sn (3.3.34) 

So Sn is a singular simplex in A^. 

Now, the standard operations on a simplex o-„ are defined on . For example we can 
take the boundary of s„ as 

dnSn = f{dn^n) (3.3.35) 

Obviously we again have dn{dn+iSn+i) = 0. 

In this way, we can define chains, cycles, and boundaries in the exact same way as 
in the previous section, allowing us to define the singular homology groups, which are 
exactly isomorphic to the simplicial homology groups, and we therefore won't pursue 
them further (remember we are not really interested in homology in this section - we are 
doing all of this merely to prepare us for more discussion of cohomology). 
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What this construction allows us to do is define the integration of some form ujn on tVI 
over a chain s„ on A^: 



(3.3.36) 



where f{an) = Sn and f* is the pullback of / (cf section 2.3.6). For a general n-chain 
c = we write 



(3.3.37) 



Then, using (3.3.32 1 to write out the general form of the integral, we can write this as 



dx 



dx^ 



^^^^ 



dx^ rM 



dx^ 



dx^ 



l-x^-x^ 



dx^ 



(3.3.38) 



where V'n = f*{^n) is merely the n-form ujn pulled back to the subset of M" where the 
standard simplex (t„ sits. 

Now let's consider the situation in which i/j^ is exact - it can be written as the exterior 
derivative of an (n — l)-form an-i- 



If we write an-i out in components, we have 



l2,...,ln-l 



dx'^ A dx'^ A ■ ■ ■ A dx'''- 



(3.3.39) 



(3.3.40) 



However, due to antisymmetry ajj,j2,- -,in-i ^i^l only have n—l independent components'^ 
For notational simplicity, we therefore give it a single index i that runs from 1 to n — 1, 
and write it as 



= aidx^ A ■ ■ • A dx^ A ■ ■ ■ A dx^ 



(3.3.41) 



where the hat indicates that the form below it is omitted. The exterior derivative of q;„_i 
is then 



d(y. 

don-i = TT^dx^ A dx^ A ■ ■ ■ A dx' A ■ ■ ■ A rfx" 
ox^ 



(3.3.42) 



^^Remember that ipn is a form in M" and is therefore a volume element. 



154 



We are summing over j, but obviously only a single term will survive (when i 
because of antisymmetry Nonetheless we leave the summation in for now. 
So 



j) 



dx^ A dx'^ A ■ ■ ■ A dx' A ■ ■ ■ A dx""' 



dx^ 



dx A ■ ■ ■ A dx' A ■ ■ ■ A dx"' A dx^ 



-1 ' 



n-l 



dx^ A ■ ■ ■ A dx^ A ■ ■ ■ A dx^ A dx^ 



dx^ 



1 nl-x^ 

' dx^ j dx^ 
Jo 

^fe 



dx^ 







dx"^ I dx^ 

./n OX^ 



JO 



dx ■ ■ ■ 







\n-l 



dx 
Jo 

1 nl-x^ 

dx^ / dx"^ ■ ■ ■ 
Jo Jo 

i_y-i-l k 



(ix* ■ ■ ■ 



i-T.lZlx'^ 



dr'^a] ~ ^p=^ 



dx 



a; 



(3.3.43) 



Notice that, after the discussion immediately following equation (3.3.42 1, we have written 
the bounds in the last line of (3.3.43 1 in terms of instead of x^ as in the previous lines. 

This equation is quite ugly, but we can make some sense of it. First of all, notice that 
in each of the standard simplexes drawn above, an n-dimensional simplex {n > 0) consists 
of several (n — 1) -dimensional simplexes touching the origin, a single (n — 1) -dimensional 
simplex not touching the origin, and all of the n-dimensional space "between" each of 
these filled in. The n = case is special because there are no — 1-dimensional simplexes, 
but the idea still holds. For example, the one-simplex is a 0-dimensional simplex touch- 
ing the origin (located at the origin), another 0-dimensional simplex at the point 1 G M, 
and then the one-dimensional space between them filled in. The two-simplex has two 
1-simplexes at the origin, one along the x-axis and one along the y-axis, a third 1-simplex 
not touching the origin stretching from (0, 1) to (1, 0), and then the 2-dimensional space 
between these three 1-simplexes filled in. 

Also, notice that in each of the standard simplexes, if we set one of the vertex vectors 



(3.3.28 ) equal to 0, the result is an n— 1-simplex. For example, with the standard 2-simplex, 
if we set y = for every point, we "project" the 2-simplex down to the x-axis, resulting in 
a 1-simplex. We could also have set x = resulting in a projection of the 2-simplex to the 
y-axis. We have three ways of projecting the 3-simplex to a 2-simplex: set x = projecting 
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it to the y, z plane, set ?/ = projecting it to the x, z plane, or set 2; = projecting it to the 
X, y plane. 

So in ( 3.3.43[ >, notice that each term in the sum over % is an integral over the entire 



n- 



dimensional simplex, but with the x* coordinate either set to or to one minus the sum 
of all of the other x's. If we look at the term where x* is set to 0, notice that we have 
exactly the "projection" from the n-simplex onto an {n — l)-simplex as in the previous 
paragraph. Therefore ( 3.3.43[ > is a sum over every {n — l)-simplex forming the boundary 



of the n-simplex On the integral was originally over. 

The term with set to one minus the sum of all of the other x's on the other hand will 
be the (n — 1) -simplex that is not touching the origin. 

So, each term in this sum over % will result in each [n — l)-dimensional simplex forming 
the boundary of o"„ being integrated over. The {n — l)-simplexes that touch the origin 
correspond to one of the x's be ing set to 0. This means that The ai component of q;„_i 



(which has no x* component, cf (3.3.41 1) will be zero on the (n — l)-simplex projected onto 
by setting x* = 0. Then, we integrate each component oti over the {n — 1) -dimensional 
simplex that is not attached to the origin, each of which may have a non-zero component 
on that face. 

In other words, the integral /_ (ia„_i is equal to integrating a„_i over the boundary of 
On- This gives us the remarkable result 

dan-\ = / «n-i (3.3.44) 

Notice that the integral on the left is an ra-form integrated over an n-dimensional space, 
whereas the right is an {n — l)-form integrated over an (n — 1) -dimensional space, so the 
integral makes sense on both sides. Equation ( 3.3.44[ > is called Stokes Theorem. It is an 



extraordinarily powerful result, and we will see soon that it is actually very familiar to a 
physics student. 



To illustrate (3.3.441 we consider a two-dimensional example. We'll work in Cartesian 
coordiantes where x^ = x and x^ = y. We will integrate over the 2-simplex 

0-2 = {x,?/ e M^|x,?/ > 0, x + y < 1} (3.3.45) 

We then take ai to bf3 

«! = aidx'^ + a2dx^ (3.3.46) 

The exterior derivative will be 

dai = ^^dx^ A dx* 
ox^ 

dai , n ,1 da2 ,1 , o 
= -TT^dx^ A dx^ + -TT-rdx^ A dx^ 
ox^ ox^ 

= f^-^V^'Arfx^ (3.3.47) 
\ ox^ ox^ J 

^^Note the bad notation - ai is both the name of the one form as well as the 1 component of the one form. 
The context will make it obvious which we are talking about so this won't cause a problem. We just wanted 
to bring it to your attention. 
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Our integral will then be (using the first line of (3.3.47)) 



da I 



0-2 



doi 

dai 
dxi 



-dx^ A dx'' 
dx'' A dx^ 



- dx' A dx^ 



1 



dai 
dxi 



- dx' 



dx 



JO 

1 



2 \x^=l—x'^ 



- / dx ail 1 



x^=0 



dx^ 
Jo JO 

dx aal^.^o 



dx 



dx'^ 



JO 

1 .1 

dx^ai{x^ = 0) + dx^a2{x'' = 0) 
Jo 

/ dx''ai{x^ = 1 — x^) — / dx^a2{x'' = 1 — x^] 
Jo Jo 



(3.3.48) 



The first term is integrating ai up the y-axis at x = and the second is integrating a2 
across the x-axis with y = 0. Notice in ( 3.3.46[ > that ai has no a; -component and a2 has no 
?/-component, and therefore there is no need to integrate 0^2 along the y-axis or ai across 
the X-axis. However, both ai and 02 may be non-zero on the 1-simplex not touching the 
origin (from (1, 0) to (0, 1)), and therefore we integrate both ai and 0:2 across this line in 
the third and fourth term. The minus signs are merely to factor in the directed-ness of the 
simplex. 

So, Stokes theorem has give us a relationship between the integral of an exact form 
u!n = dun-i over some space (t„ and the integral of «„_i over the boundary dan- 

Furthermore, as we said previously, we can map the standard simplexes into a man- 
ifold M. to integrate over a neighborhood of M., and so by using everything we know 
about integration, we can generalize this to integrals of forms on arbitrary manifolds. 

To see how Stokes theorem relates to standard physics, consider the covector u = 
(pxdx + (pydy + (pzdz G M'^. We found the exterior derivative of in equation (2.3.791, 
and recognized it as the curl of lj. If we want to integrate du; = 'V x uj over some two 
dimensional area A with boundary dA = C, Stokes theorem gives us 



V X ll) -dS 



iv-dS 



(3.3.49) 



c 



This is a standard result in Ek,M and is what physicists typically mean when they refer 
to "Stokes theorem". 



Furthermore, if we imitate what we did in (2.3.77), where 

4>2 = 4>x dy A dz + (py dz Adx + 0^ dx A dy 



and 



d(f)2 



dx dy 



dz 



dx Ady A dz 



(3.3.50) 



(3.3.51) 
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which we recognized as the divergence of or V ■ 02, then we can integrate this over 
some three dimensional volume V with boundary dV = S. Stokes theorem gives us 

[ V ■ 02C/V = (f cj)^-dS (3.3.52) 
Jv Js 



Equation (3.3.52) is another standard result in EhM. For example, consider the Maxwell 
equation 

V ■E = ^ (3.3.53) 

where p is the charge density and the constant eo is the permittivity of free space. We can 
integrate both sides over some volume V: 

V ■E = - [ p (3.3.54) 



V ^0 Jv 



The right side is the integral of the volume charge density over a volume and is therefore 
simple the total charge inside the volume, or Q. We can apply Stokes theorem to the left 



side as in (3.3.521, getting 



E-dS = ^ (3.3.55) 

s 

So the total electric charge inside of some space is equal to the integral of the electric 
field across the surface of the space. This extraordinarily powerful result is referred to in 
physics literature as Gauss' Law. 

So, both Gauss' Law (which is typically used to do calculations with electric E fields) 
and (the physical version of) Stokes Law (which is typically used to do calculations with 
magnetic B fields) are both consequences of the same general law - (the mathematical 



version of) Stokes theorem, (3.3.44). 



3.3.7 The Duality of Homology and Cohomology 

As we have stressed previously, homology relates to a space while cohomology relates 
to forms. Put another way, homology relates to what you integrate over, while cohomol- 
ogy relates to what you integrate. 



As we discussed in section 2.1 given some space of objects V, the dual space is the set 
of objects V* which map elements of V to M. We can naturally define this exactly within 
the context of homology and cohomology. Namely, consider a form u. Obviously cu is not 
generally an element of M. But forms are things we integrate, and when we integrate u 
over some space Ai, the result is in M. In other words, spaces are dual to forms through 
integration. 

To make this more precise, consider the manifold Ai with chain group Cn{A4) and set 
of rz-forms A^Ai. If c G Cn{Ai) and cu E A'^Ad, then we define the inner product between 
c and cu as 

{c\cu) = cu eR (3.3.56) 
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This dejfinition is linear in both c and 



J c J c J c 

{c-\-c!\lo) = I UJ = UJ + 

J c+d Jc Jc' 



UJ (3.3.57) 

' c+c' J c J d 

We can write Stokes theorem in the compact form 

{dc\uj) = {c\duj) (3.3.58) 
Then consider the operator Ua which acts on a vector as 

Ud\uj) = \(Lj) (3.3.59) 
Obviously it will be the conjugate of this that acts on the dual space. So, 

{c\duj) = {c\Ud\uj) = {dc\uj) (3.3.60) 

So 

{c\Ud^{dc\ =^ iUay\c) = \dc) (3.3.61) 

So 

{U^y = Ua (3.3.62) 

We will see this relationship in greater detail when we discuss Hodge Theory in the next 
chapter. 

Another obvious consequence of this duality is that 

{c\u) = (3.3.63) 

if either 

1) c G BniM) and lj G or 

2) c G Zr^iM) and G 

Also, recall that i^„(A4) = Z^{M)/Bn{M) and H'^iM) = Z''{M)/B''{M) both consist 
of equivalence classes of Zn{M) and Z^{M), respectively. We can denote the set of all 
chains equivalent to c by [c], and the set of all closed forms equivalent to u by [ou]. So, 
[c] G Hn{M) and [uj] G H"-{M), and we have the natural inner product 

{[c]\[uj]) = Ju (3.3.64) 

And because of Stokes theorem, this inner product is well defined regardless of the choice 
of element in [c] or [cu]. For example, if some other cycle c" is equivalent to c and hence in 
[c], then c and c" differ by an exact form dd: 

c" = c + dc' (3.3.65) 
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So the new dot product will be 



c+dc' J c J dc' 



to + J du 

u (3.3.66) 



c 



where we used Stokes theorem to get the second line and the fact that to is closed to get 
the third. Therefore, any element of [c] will give the same integral as c. 
Also, if uj" is equivalent to uj, then 

uj" = u + du' (3.3.67) 

and 

' u" = [{u + du') 



u+ I u' 

dc 

UJ (3.3.68) 

where we used the fact that 9c = (because c G Zn{M) by definition). 

The above considerations show us that Hn{M.) is indeed the dual space to H^{M). 

3.3.8 The Euler Characteristic and Betti Numbers 

As we have seen, the cohomology group {H"-{Ai)) is isomorphic to the homology 
group {H„{Ai)), except Hn is over Z and is over M. This similarity allows us to define 
the Betti numbers equivalently as 

b''{M) = dim{H''{M)) = dim{Hn{M)) = h^{M) (3.3.69) 

And therefore (following ( |3.2.65 >) the Euler number is 



X 



(M) = Y.h'^yb'iM) (3.3.70) 



i=0 



This equation is one of the most remarkable in all of mathematics. The left hand side is 
a purely topological statement about A^. It relies entirely on global information about the 
qualitative shape of Ai, and is completely independent of any geometrical information. 
The right hand side, on the other hand, comes from purely analytic and geometrical state- 
ments about Ai. The basis of the cohomology groups (from which 6" is defined) is the set 
of equivalence classes of Z^{M), which is the set of solutions to the differential equations 

du = (3.3.71) 
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This interplay between topology and analytic geometry provides one of the deepest, 
richest, and most powerful avenues for mathematics. We will be diving much, much 
deeper into these types of relationships as we proceed though this series. We will see that 
such considerations actually form the backbone of much of string theory and fundamen- 
tal particle physics. 



3.4 Concluding Thoughts on Algebraic Topology 

We have considered homotopy, homology, and cohomologyj^ We saw that each of 
these provide a way of calculating the various types of "holes" that may appear in some 
space M. Homotopy simply told us what types of maps of can and cannot be con- 
tracted to a point, and in general formed non-Abelian groups. This notion of "what types 
of 5" can be contracted" was the whole point of homotopy. While this approach to clas- 
sifying spaces is extremely easy to visualize, actually calculating homotopy groups is 
extremely difficultp] 

Homology, like homotopy, also gives us information about the n-dimensional holes 
(places where 5*" cannot be contracted to a point), but there is a tradeoff. Homology 
groups are much easier to calculate, but they provide less structure in that they are merely 
free Abelian groups, unlike the generally non-Abelian homotopy groups. Also, the con- 
cept of what a homology group is is slightly more difficult to understand (the set of all 
things that could be boundaries not including the the things that actually are bound- 
aries). Fortunately the group structure we lost is not vital - often all we need to know is 
the number of holes - the added information in homotopy groups is not necessarily nec- 
essary. Also, the use of simplexes in homology brought us a little closer to geometry, and 
in that sense a little familiarity. 

Cohomology in many ways provides exactly the same information as homology. The 
only different in the information provided is that homology uses integers Z while co- 
homology uses the reals M. This difference isn't really substantial (especially for our 
purposes). What makes cohomology interesting is how it is calculated. Homology was 
based on the space itself - this made it easier to visualize (though admittedly more diffi- 
cult than homotopy), but less familiar to physicists. It wasn't difficult to see directly from 
the definition how the topological information about M provided produced topological 
information about Knowledge of the simplicial complex (which is essentially topo- 
logical information about Ai) gave us a way of calculating the number and types of holes. 
Topological information led to topological information. 

Cohomology, on the other hand, provided similar information as homology, but didn't 
require any topological information. While how this works wasn't obvious, (though we 



did provide an illustration starting on page 151 1, the calculations involve knowing nothing 



^''The order we discussed these in was very deliberate. We started with the conceptually easiest and 
went to the most conceptually difficult. And, we started with the "most topological" (read "least geometry 
involved) and went to the most geometrical. 

18 As we mentioned in the section on homotopy, there are a few very powerful methods of calculating 
homotopy groups, and we will discuss some of them later. However, even with such "machinery", such 
calculations remain notoriously difficult. 
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about the topology, but instead only analytic information about what types of forms make 
sense on A4. This provides greater familiarity to a physicist (who is accustomed to the 
vector space structure of forms), and provides insights into the deep interplay between 
geometry and physics. 

3.5 References and Further Reading 

The primary source for the section on homotopy was ||36| and ||38| , and the primary 
source for the sections on homology and cohomology were 13711 and [38]. For further 
reading in algebraic topology we recommend [7|], [|10||, [14], [21]], pl|], [|36|], [|41||. and [|46i]. 
For introductions to general topology we recommend ||32J , [,34J , and II43B . 
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Chapter 4 

Differential Geometry 



Now that we have given a brief introduction to some of the topological aspects of mani- 
folds, we return to our exposition of the geometrical considerations necessary for mathe- 
matical particle physics. 

The basic idea of this chapter is the introduction to a differentiable manifold of a struc- 
ture called a metric. In section 2.3.13 we discussed several of the questions that still re- 
mained after that initial chapter on manifolds. We currently have no way of telling the 
difference between a perfect sphere and an egg (and the topological considerations of the 
last chapter obviously don't help at all either). Things like distance between points, an- 
gles, and the relationship between vectors and covectors are also out of our reach thus 
far. 

The introduction of a metric on A4 will provide explanations/ answers to all of these 
things. Whereas before we can stretch, twist, bend, etc. ^A into whatever (homeomor- 
phic) shape we want, adding a metric essentially makes rigid. However it doesn't 
make it completely rigid. The metric limits us from being able to stretch Ai however we 
want to being able to only stretch Ai in directions perpendicular to itself. A good anal- 
ogy would be a piece of paper. If you lay the paper down on a table, you can't stretch it 
along the direction of the paper (it would tear). However, you can bend the paper into a 
cylinder, etc. This involves only bending that is not along the direction of the manifold. 
The paper is rigid, but not completely rigid. 

Another way of thinking about this is to take the same piece of paper and draw two 
intersecting lines. You can measure the angle between those lines easily. Then you can 
bend the paper however you want, and (assuming you don't tear it), the angle between 
those lines will be the same. 

These two equivalent ideas are the point of a metric. We are adding a structure that 
prevents us from deforming Ai along its own dimensions, and/ or we are only allow- 
ing deformations of Ai that do not change the angle between two arbitrary lines. The 
following sections will make this precise. 
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4.1 Metrics 



A Riemannian Metric gij on some differentiable manifold 7W is a type (0, 2) real sym- 
metric tensor field (no upper indices, two lower indices, cf page 19 ) defined at each point 
p E Ai. For p e Ui ^ M. with coordinate functions 0*^'^ = x, we write the metric as 

g(p) = gij{p)dx' (g) dx^ (4.1.1) 

where the rfx* are a coframe (basis covectors for the cotangent space) for A^Tp^A. The 
metric (4.1.1 ) will act on tensor products of the basis for Tp^A in the natural way: 

d 



gij{p)dx'^ ® dx^ 



More generally, for vectors 



dx°- 



_d_ 
dx^ 



9ab{p) 



(4.1.2) 



,(2),i 



d_ 

dx^ 
d_ 

dx^ 



(4.1.3) 



at p G f/j C M, we have 
^(p)(v 



{1) 



gij{p)dx^ ® dx^ 



ox°- ox° 



(4.1.4) 



We will drop the p from the notation for the metric unless doing so causes confusion. 
As stated above, we demand that g^j be symmetric: 



9ij 9ji 



(4.1.5) 



Mathematicians make the distinction between a true Riemannian metric and a 
pseudo-Riemannian metric by requiring that 



9 



(4.1.6) 



(where the equality only holds when either v = 0) for all vectors at all points for a metric 
to be "Riemannian". This condition is relaxed for a pseudo-Riemannian metric - we only 
require 



V v(^) G TpM 



v(^) = 



(4.1.7) 



Most physics is done on pseudo-Riemannian manifolds and we will therefore focus on 
them, though we will talk about both. 
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4.1.1 Metrics, Vectors, and Covectors 



We said on page 17 that there is no such thing as a dot product between two vectors. 
This was only somewhat true. If we have a manifold Ai with a metric g, then we can take 
any two vectors and map them to M. In this sense, we have an inner product between two 
vectors: 

^(v^'^v^-'')) G M (4.1.8) 

However, an equivalent (and better) way of thinking about this is that the metric pro- 
vides an isomorphism between TpA^ and A^Tp^A. To see this consider a vector v with 
components and a covectors u with components uj. The inner product between them 
(as defined in chapter 2), is given by 

(Uidx'') ( - — : I = UiV^dx""- — : 

= Uiv' (4.1.9) 

So the inner product between them is the sum of the products of the components. 

Now, for two vectors v*^^) and v^^^ (with components f and f respectively), the 
inner product using the metric will be (using ( 4.1.4[ >) 



^(vW,v(^')) = QyV^^^^'v^^^^^ (4.1.10) 

Now take the first part of the right hand side of this, gijV^^'''\ We can treat this as a single 
object with only a single lower index (the j index), because the i index is a summed 
dummy index: 

g^JV^'^^' = aj (4.1.11) 

But, we can recognize aj as the components of some covector in A^TpAi. Therefore we 
can express the isomorphism the metric produces between TpM and A^TpM: 

ooi = gijv' (4.1.12) 

where Ui is a 1-form in A^TpA^ and is a vector in TpAd. 

Also, we require that the metric be non-singular, and we denote its inverse as 

9'' = {9^,r' (4.1.13) 

and 

9'''9ij = 9ij9'^ = S'l 

g'^g,, = 61 = n (4.1.14) 
where n is the dimension of the manifold. We can use this to see that we can also write 

uJi = gijv^ =^ g^'uji = g'''g^jv^ =^ g'^'ut = (4.1.15) 
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Summarizing, we have 



(4.1.16) 



In other words, the metric is used to "raise" or "lower" the indices of a vector or a covector, 
turning one into the other, etc. 

Generalizing, for a general tensor with multiple indices, we can apply the metric to 
each index separately: 



.abc 



giagjbgkcv 

„ia, „jb ^kc, , 

g g g ^atc 



(4.1.17) 



and so on. 



4.1.2 Simple Examples of Metrics 



So what does a metric do? We will first consider this question in terms of a few simple 
examples. 

The simplest example of a metric is the Euclidian Metric, where 



gij = g 



/I 
1 



\0 







1/ 



Then using (4.1.41 the inner product between two vectors v^^^ and v*^^) is 



(4.1.18) 



(4.1.19) 



Or, we can find the covector corresponding to v^^^: 

So the components are simply equal, and we therefore write the form 



v{i),i = gijV^^^'^ 



instead of cjj. So the inner product between v^^^ and v^^^^ in (4.1.19 1 is actually 



(4.1.20) 



(4.1.21) 



(4.1.22) 



Once again this is actually an inner product between a vector and a covector (the metric 
turns vectors into covectors). In this way our "upper and lower indices are summed" 
convention holds. 
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Another example is called the Minkowski Metric, 



9ij = 9'^ = Vij 



/-I ■■■ 0\ 
1 ■■■ 



Vo ; 1/ 



(4.1.23) 



Now we 



(4.1.24) 



i=l 



Furthermore, when we raise or lower indices, we find that 

Villi = v^^^'' i^Q 
^(i),o = 



(4.1.25) 



(and the same for v*^^^). Therefore the "upper and lower index" form of the inner product, 

^(v(^), v(^)) = = + v^'^'\^''>'' (4.1.26) 



i=l 



still holds. 

We can also write down a more general metric with more than one negative compo- 
nent. Consider a metric with n — I's and m +l's: 



9ij = 9 ■ 



Now the dot product will be 



/ -1 
-1 




\ 



\ 




1 
1 / 

m+rt— 1 



(4.1.27) 



(4.1.28) 



i=0 



Such a metric is said to have index (n, m). For the special case of = 1 we call g a Lorentz 
Metric, or a Minkowski Metric. 

Finally, let's look one more time at the Euclidian metric (4.1.18). Because gij = Sij is 
simply a tensor, we can do a normal transformation to write it in any other coordinate 
system. For example in we have (in Cartesian coordinates) 



9ij = 



1 
1 



(4.1.29) 



^We are taking the first component to have index instead of 1 for later convenience. 
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If we wanted to transform this to, say, polar coordinates, we can use the standard trans- 
formation law for a rank 2 covariant tensor ( 2.3.52[ > (using a tilde to represent the polar 
coordinates) 



So, using the transformations 



dx'^ dx^ ^ 



X = r cos 9 

y = rsinO (4.1.31) 



we have 



ox^ ox^ 



dx^ dx 
dx^ dx 

dx^ dx^ 
dx dx dy dy 

dr dr dr dr 

cos^ e + sin^ e 



kl 



And similarly 



dx'^ dx'' 



So, in polar coordinates we have 



(4.1.32) 



dx dx dy dy 

dem ^ mm 

2 ■ 2 Q , 2 2 Q 

r sm tl + r cos a 

(4.1.33) 



~9^^ = {l °) (4.1.34) 



Generalizing to M'^, you can start with the three dimensional Euclidian metric on in 
Cartesian coordinates. 




9^J= [0 1 (4.1.35) 
\0 ij 

and transform to show that in spherical coordinates this metric is 

/I 0\ 
gij = r^sin^^ (4.1.36) 
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4.1.3 The Metric and the Pythagorean Theorem 

Consider a Euclidian manifold with metric gij = dij. Then consider some vector 

V ^ x^ei + x^es + x^Bs (4.1.37) 
We can use the metric to lower the index to find the corresponding co vector w: 

Xi — Qij-^'^ — ^ij-^"^ 

=^ Xi — X^ 
^ X2 — x'^ 



(4.1.38) 



So, the inner product between the vector v and its corresponding covector is 

g,(v,v) = W(v) = XiX' = X^X^ +X^X^ +X^X^ (4.1.39) 

We recognize the right hand side of this expression as the length squared of the vector v. 
If we had chosen a different metric, say 




9ij= 10 (4.1.40) 



then we would have a different length for the vector: 

g{yf^Y)^ 2x^x^ ^x^x^ ^x^x^ (4.1.41) 

So these two metrics correspond to two different "types" of M^. The first is "normal" 
three-dimensional Euclidian space, while the second has been contracted along one axis. 
However, this can actually be thought of as merely a coordinate transformation where we 
relabel the x-coordinates. 

For this reason we can also think of a metric as relating to distance. Specifically, con- 
sider moving an infinitesimal distance (generalizing to an arbitrary three-dimensional 
manifold) 

dx^ + dy^ + dz-^ (4.1.42) 

ox ay oz 

Obviously the infinitesimal displacement will be 

{dxf + {dyf + {dzf (4.1.43) 
If we want to know the distance between two points a and b along some path on M, we 
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simply do the integra^^ 

/ ^{dxf + {dyf + {dzf (4.1.44) 

J a 

along the path from a to h. 

More generally, for some infinitesimal displacement 

dx'-^ (4.1.45) 
dx' 

we can find the infinitesimal distance, denoted ds^, as 

ds^ = g\dx^-^,dx^ ^ 



dx^ ' dx^ 

dx'dx^ g( 7^ 

\ ox^ ox^ ^ 

Qijdx'dx^ (4.1.46) 



which is essentially the definition of the metric ( 4.1. 1[ >. For this reason we will speak of 



the "metric" and the "infinitesimal displacement squared" interchangeably. 
4.1.4 Induced Metrics 

Finding a metric for a space is extremely important for most physical applications. 
However doing so may not always be easy. One very powerful tool for finding the metric 
of a given space is to find an induced metric. 

Writing a metric for, say is easy - gij = 5ij. But what is the metric for a circle? This 
may not be so obvious. However, it is possible to write the circle as a subspace of M^: 

X = r cos 6 

y = rsme (4.1.47) 



(for constant r). We can think of (4.1.471 as a map|^F from into M^: 



F : — 

F\e) = rcose 

F\e) = r sine (4.1.48) 



^Generally we rewrite this as 

nb nb 

/ ^/{dx)^ + {dyY + {dzY = / dx^J\ + ,i/'2 + z'2 

J a J a 



where the prime denotes a derivative with respect to x. This expression is the one generally learned in an 
introductory Calculus course for distance. 



^It would be helpful to reread section 2.3.6 at this point 
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Therefore, any tensor product of forms on can be mapped back to by using the 
puUback /*. And because a metric (on M^) is a tensor product of forms, we can find its 
puUback on 5^: 

9s^ = rigu'^) (4.1.49) 



So applying (2.3.641 to g = 6ijdx^ dx^, 



f{6ijdx' (g) dx^) = 6ijf*{dx' ® dx^) 

OF' dF^ 

~de~de 

'dF^V fdF^^^ 



^ij^^^^dO (g) dO 



09 J \de 

= {{-rsinef + {r COS e)'^)d9®d9 

= r'^de®de 

= r\def (4.1.50) 

This is exactly what we would have expected - this is saying that the distance you've 
moved if you walk some displacement around a circle is proportional to the radius and 
the angle. In other words 

ds^ = r\dey =^ ds = rdd (4.1.51) 

We can integrate this to get 

s = re (4.1.52) 

which is a standard result in introductory math and physics. The most familiar case of 
(4.1.52) is that if you walk around a full circumference C, or ^ = 2n radians, then ( 4.1.52[ >) 
gives 

C = 2Txr (4.1.53) 

The standard expression for the circumference of a circle. 

On the other hand, we could map into in a different way. Consider 

X = 2r cos 6 

y = rsm9 (4.1.54) 

(again for constant r). This creates an ellipse with radius 2 along the x axis and radius 1 
along the y axis. 
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Now we will have 

= r^(4cos^^ + sin2^)d^2 (4.1.55) 

So for the circle we have 

dslrcie = r-'de^ (4.1.56) 

and for the ellipse we have 

dslui,se = r\A cos' 9 + sin' 9)d9' (4.1.57) 

We can graph each of these as a polar plot. For the circle, we graph R{r, 9) = r', which 
gives 
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And for the ellipse we can graph R{r, 9) = r^(4 cos^ 9 + sin^ 9), which gives 



Recall from the previous section that you can think of the metric as defining an infinites- 
imal distance. In other words if you start from a point on and move a small dis- 
placement d9, the distance (squared) you travel is given by ds'^ = gijdx'^dx^ , or in our case 
ds^ = geed9'^. For the circle this is simply constant - no matter where on the circle you are, 
a small displacement in 9 takes you the same distance. On the ellipse, however, a small 
displacement does not have the same result at any two points. For example, if you are on 
the point on the positive x axis {9 = 0), you are very far from the origin and therefore a 
small change in 9 will move you farther than if you were close, for example on the posi- 
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tive y axis {9 — Tr/2). Therefore the value of the metric is the greatest at the points farther 
from the origin, and the least at the points closest. 

Notice that these two maps are both from S^. They are completely homeomorphic to 
each other. The only difference is that one is stretched out more than the other when drawn 
in R^. One difficulty with our approach is that we are only talking about metrics that are 
induced on a submanifold by the manifold it is embedded in. With that said, should we 
really expect a difference between (iScircie ^^1*^ '^■^liupse'^ They are both one-dimensional 
manifolds, and moving along that one dimension a certain distance is simply that - mov- 
ing that certain distance. The reason we found different metrics is that we embedded in 

in different ways, and as a result the value d9 had a different meaning in the two cases. 
However, some given distance on the circle isn't really different than the same given dis- 
tance on the ellipse. We therefore suspect that there is no substantive difference between 
the two, and the different metrics are merely due to different coordinates (in other words, 
different 9's). Sure enough, it is straightforward to show that if we start with the circle 
metric 



gee = (4.1.58) 

We then take the coordinates on the ellipse to be 9'. We can do a coordinate transformation 
to get the metric on the ellipse from the metric on the circle: 



9e'e' an, ao,^ 



d9_d9_ 
89' 89' 



89 \ , 



. r 

36' 



= r^{Acos^9' + siT?9') (4.1.59) 
which would come from the coordinate transformation 



j d9'\/l 



cos^9' + sm^9' (4.1.60) 



Evaluating this in closed form is difficult because it is an elliptic integral, but it can be 
done and the result would simply transform gee into ge'e'- So we have shown that the 
metric dslmp^^ and dsl^^^^ are actually the same metric, differing only in the coordinates 
being used. 

In general, if two manifolds (appear to) have different metrics, but there exists some 
coordinate transformation that takes one metric to the other, then the metrics are actually 
equivalent. On the other hand, if no such transformation exists then the metrics are truly 
different. In general such an approach is a highly nontrivial (and essentially impossible) 
problem and other techniques must be used. Such ideas will be discussed later. 

As another example, consider a map from 5"^ (with unit radius) into R^ given by 

X = sin 9 cos 
y = sin 9 sin 

z = cos^ (4.1.61) 
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We leave the details to you, but the pull-back of the Euclidian metric on M.^ will give 

ds^ = de^ + sin^ ed(j)^ (4.1.62) 



On the other hand, we could choose another map homeomorphic to S'^ but with a 
different metric. Consider 



X = sin 6 cos < 
y = sin 6 sin <; 
z = X cos 9 



(4.1.63) 



for some arbitrary constant A. We again leave the details to you and merely quote the 
result. 



ds^ = - ((A' + 1) - (A' - 1) cos{2e))de^ + sin^ ^-^^^ 



If we choose A to be greater than 1 we get an egg: 




On the other hand if we choose A to be smaller than 1 we get a pancake: 




(4.1.64) 
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As a illustrative exercise, consider the A << 1 regime, which gives a very flat pancake. 
Here 

ds^ = -(1 + COs(2^))rf^2 ^ gi^2 Q^^2 (4.1.65) 

In the 9 «1 region of the manifold (in the center of the flat part), we have 

ds^ = de^ (4.1.66) 

or simply 

ds = dO (4.1.67) 

Here a change d9 corresponds to moving radially outward from the center of the flat part 
of the pancake, and this metric tells us that indeed the lines are flat. Furthermore, in this 
limit, there is no real meaning to a change in 0, as indicated by the metric. Notice that 
ds = dO is the same metric we would have for a circle of radius 1. In fact, in the 6' = 7r/2 
regime we have 

ds^ = d(j)^ (4.1.68) 

The difference in the two relates to the global topology of the space we are considering. 
The metric for W- is ds^ = dx^, and the metric for a unit radius is ds^ = dO"^. But while 
they have the same metric, they have different topologies. This illustrates an important 
point that we will discuss later - the metric is a local structure. It gives us information, 
point by point, about infinitesimal distances. The metric is not, however, a global struc- 
ture, and it therefore doesn't tell us anything about the topology of the manifold. We will 
consider physical ramifications of this later. 

For now, however, we are merely trying to show what kind of information the metric 
can express by considering various limits of things homeomorphic to S"^. 

As a final example consider the map from the torus into given by 

X — {R + r cos 9) cos 
y — {R + r cos 9) sin (p 

z = rsin^ (4.1.69) 

where R and r < R are the two radii of the torus. We again leave the details to you to 
show that the Euclidian metric on will pullback to induce 

ds"^ = r^d9'^ + {R + r cos 9)'^d(i)'^ (4.1.70) 

For any manifold J\f that you can write as a submanifold of another A4, you can easily 
find the metric on Af that is induced by the metric on M by simply using the pullback. 
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4.2 Connections and Covariant Derivatives 



4.2.1 Parallel Transport 

In this section we will continue with our discussion of metrics, though it will not 
be immediately obvious that this is what we are doing. Our goal will be to build up a 



more general calculus on manifolds. But, recall from section 2.3.9 that we had a problem 



when we wanted to find the derivative of some vector field (cf the discussion on page 



74 1 over a manifold. The problem was that derivatives involve comparing vectors at two 
different points, say p and q. However the value of a vector field at p and the value at q are 
(obviously) in two different tangent spaces. And because there is no natural way to relate 
tangent spaces to each other, this comparison (between the vector at p and the vector at q) 
is not well defined. 

One consequence of this is the transformation properties of a derivative. In general, a 
vector is described by a set of numbers in some basis (i.e. (x, y, z), (r, 6*, 0), (p, 0, z), etc.). 
However this vector is independent of the coordinate system used. This is why vectors 
have specific transformation rules under a coordinate transformation - the description re- 
tains its identity even though its description changes. Such a transformation is called a 
covariant transformation. We have considered such transformations extensively in pre- 
vious sections. 



Recall from the discussion following (2.2.251 that any (generally nonlinear) change 



of coordinates induces a linear change of components at each point. The basic idea is 
that we want things that transform in a tensorial way under such transformations (cf 



page 25 1. A derivative, however, is by definition evaluated at two different points in the 
manifold (taking the difference between the tensor at two points and dividing by their 
distance). For this reason the standard definition of a derivative will not transform in the 
nice linear tensorial way under a coordinate transformation. To see this, consider a vector 

dfoldAI.W 

dv'^ ( d 



field V = w't^ defined over a manifold M.. We can take the derivative of v 

ox'- 



{v') (4.2.1) 



If we want to apply a transformation to this, the derivative part will transform according 
to ( 2.3.9[ >, and the vector part will transform according to (2.3.13 1. So (4.2.1 1 will transform 



according as 



dx^ / ^ V dx'^ 



-V 



dx'^ dx'' J \ dx' 
dx'' dx'^ dv' dx'' d'^x'' 



dx'^ dx' dx'' ~^ dx'i dx''dx' ^ ^ 

Notice that the first term in the last line would be the expected transformation law for an 
object with a single covariant and a single contravariant tensor (i.e. |^). However the 
second term means that this is not a tensorial object. 
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The reason, as we have said, is that a derivative involves evaluating two different 
points, and because of the general non-linearity of a coordinate transformation, the com- 
ponent transformation for a derivative that is induced is not linear. 

To illustrate this imagine a vector field w(q(e, x(j)))) defined along the curve q(e, x(j>)). 
We can assume polar coordinates and take the partial derivative of, say w"^ with respect 
iod) 



dw'^ ( d 



[w^ (4.2.3) 



If we take 



= X x"^ 



= r = (4.2.4) 

and the coordinates of w in Cartesian coordinates to be denoted = and v'^ = , then 



the transformation of this from polar back into Cartesian is (following (4.2.2 1) 



dx^ d \ f dy^ 



dy'^ dx'^ J \ dx'- 



dx^ dy^ dv^ dx'' d'^y^ , „ _v 

= - I - I,'- (4.2.5) 

dy'^ dx' dx'^ dy'^ dx'^dx' 

Again, the first part in the last line is what we would expect for a tensorial transformation. 
Using the standard 

x^ = X = r cos (j) = y^ cos y'^ 

x^ = y = r sincf) = y^ siny'^ 

y^ = r = ^x^ + 1/2 = ^{x^y + (x2)2 

.2 - /'^^ .^„~ify'^ 



= <P = tan-^ ( = tan'^ \^ ) (4.2.6) 



we can write the second non- tensorial term in ( 4.2.5| > as 



dx'^ d'^y^ , ( dx d'^r dy d'^r \ , 

V = I 1 ]v 

dy"^ dx^dx' \d(j)dxdx d(j)dxdyj 



+ 



/ dx d^r dy d'^r \ 
\d(f) dxdy d(j) dydy ) 



,2 



= — sin0f ^ + cos0f^ (4.2.7) 

So, as expected, the partial derivative ^ doesn't transform as a tensor, due to the inho- 
mogeneous term ( 4.2. 7[ >. in other words 



dw"^ dw^ dx^ dy^ dv ^. ^ i ,2 



Qy2 Qy2 Q^l Qrj.k 



sin (f)v + cos (j)v (4.2.8) 



178 



The additional two terms resulted from the fact that the derivative involved comparing 
points in two different tangent spaces. It is for this reason that to take a derivative we 
must find a way of evaluating both parts in the same tangent space (in other words, at 



the same point), so as to avoid this problem. In section 2.3.9 we solved this for the Lie 
derivative by taking the derivative of one vector field (v^^^) with respect to another vector 
field (v^^)). Specifically, we used the curve induced by v^^^ to build the tangent mapping 
to map v*^^) at p to its value at q (where the curve induced by v*^^^ goes from p to q). 
Doing this allowed us to consider both tensors at the same point, and therefore we had a 
derivative that would transform in "nice" tensorial way. 

And while this derivative is well-defined and extremely useful for a variety of things, 
it is not the most general derivative we can define. Therefore let us rethink how we can 
find derivatives on a manifold to solve this problem. The fundamental problem is that 
we must be able to compare a vector in TpM. to a vector in TqM. where p q. This 
requires some way of "transporting" a vector from TpM. to TqM.. While the Lie derivative 
provided one way of doing this, it is not the most general. 

Consider a manifold M. and a curve through the point p, denoted q(r, x(j>)). The 
vector v(q(r, x(j>))) corresponding to this curve at p is 



(iq(r,x(j))) 
dr 



v(q(r,x(2)))) 



(4.2.9) 



(cf equation (2.3.105 1). Now consider some other (arbitrary) vector u(q(r, x(j>))) at p, that 
is not necessarily part of a vector field - it is just a vector in TpM. We want a way of 
transporting u along q (in the direction of v) without changing it. 




But what do we mean by "not changing it"? This notion is not entirely precise and 
we must therefore specify some set of rules by which we move u around. As a simple 
example, consider an arbitrary vector with components u = « 



+ u^By in '. 



32. 
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Let's take the curve q to be the line parallel to the x axis through the point (x, y) so that 
V = e^;. If we drag u along q to some other point: 



X 


/ 


X 


/ 


X 

















notice that the components haven't changed: u = u^e^ + u'^ey. Similarly, had we taken q 
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to be parallel to the y axis we would have gotten the same result. So, moving a vector u 
around with the standard Cartesian basis is trivial - no matter where you move u the 
coordinates are the same: 



uy — > yy (4.2.10) 
Now consider the exact same vector, but let's use polar coordinates: 




Notice that we can break w = u'^Gr + u'^e.^ up into components according to 

li^ = |u|cos0 
ru^ = \u\sm9 (4.2.11) 

Now let's take q to be along the r direction: 
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(r+Ar,o) 



Taking r — > r + (5r has no affect on m^, and therefore under this transformation 



u 



However, 



lul sin 6 



u 



|u| sin^ 

r + 5r 
1 |u| sin^ 

r i + 7^ 

r \ r 
lul sin 6* 



r 



6r 



u I sin 9 

6r |u| sin 6' 



(4.2.12) 



(4.2.13) 



So, when an arbitrary vector is moved along the r direction the components don't stay 
the same. An additional term is needed to "correct" for the changing coordinate system. 
Similarly, if we take q to be along the (p direction: 
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we have 



T 

u 



u| cos 6 
lul sin^^ 



u| cos(^^ — 50) ^ u"^ + u'^r6(f) 



u 



u| sm(6' - 50) . ^50 




(4.2.14) 



r 



So when Cartesian coordinates are used, transporting a vector around is trivial - 
the coordinates don't change. But when polar coordinates are used, we must include a 
term to correct for the changing coordinate system. 

So what is the use of what we have done? We have written out a way to move an 
arbitrary vector around without changing it. We denote this way of dragging a vector 
around parallel transporting the vector. Because parallel transportation is a way of mov- 
ing a vector u along a path in the direction v without changing it, there must exist some 
type of derivative, denoted Vv, such that 



VvU = 



(4.2.15) 



We call Vv the covariant derivative in the direction of v. 
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Now again consider some vector field w(q(r, x(]:>))) defined on q C 



)2. 



Focusing on the non-trivial polar coordinate case, equations (4.2.121, (4.2.131, and 



(4.2.14) tell us how an arbitrary vector can be dragged along q without changing it. This 
allows us to construct a derivative in a well-defined way. We first Taylor expand to the 
new point, then we parallel transport - the difference defines the derivative. 

For example, if we want to find the derivative of w at point q^{0,x(p)) E M^, first move 
some small distance e away from q(0, x(p)) and evaluate 



w(q(e,x(j)))) = w(x(2)) + (5x) 
= w(x(]9)) + 6x^ 



9w(x(j))) 



(4.2.16) 



Or 



tt;^(x(j)) + 5x) 



w'^(x(]5)) + 6r 



dr 
dr 



dw'^{-x{p)) 



(4.2.17) 



Then, using (4.2.121, (4.2.131, and (4.2.141, parallel transport w(q(0, x(j>)) from x(j>) to 
x{p) + 5x. For example, if q is along the 9 direction. 



w^(x(j>) + 5x) 
w'^(x(]9) + 5x) 



w^{'k{p)) + w'^{'K{p))r6(f) 
w'^{^{p))-w'-{^{p))^ 



(4.2.18) 
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Now we can build the covariant derivative of w by taking the difference between (4.2.171 
and ( |4.2.18| >. 

So let's say we want to take the covariant derivative in the (p direction. This will b^ 



lim 
lim 



^ dw"' ^ , dw"' „ ^ ^ , 
w + or— h ocp^— w — w^rocp 



dr 



6r— h 



dr 



— w^r 



lim 

Qyj<t> 



+ 6r— h 

or 



+ 



w 



(4.2.19) 



We can calculate V^w similarly: 



Summarizing, 



dr 

dr r 



(4.2.20) 



dr 



— W^r Gr + 



dd) r 



dr r 



(4.2.21) 



Now consider V,^^'' only (just as we considered only in (4.2.81). We know from 
(4.2.8) how U transforms. We can also calculate how the second part will transform: 



rw^ 



r^v' 
dx^ 

d(f) , d(f) 2 
r—v +r—v 
dx dy 

1 , 



rx 



x^ + 



x^ + 

,2 



— sin 0f + cos 0f ^ (4.2.22) 
Notice that this is exactly what we found to be the "problematic" term in ( 4.2.8[ >. This 



^Suppressing the x(p) argument for notational simplicity. 
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means that the transformation of V^w^ will be 



dw'" 



— w^r 



Q^k Qyl Q^l 

dy'^ dx^ dx^ 
dx^ dy^ dv'' 
dy"^ dx' dx'' 

And from ( 4.2.10| > we can see that 



sin (bv^ + cos (hv'^ + sin Sv'^ — cos (bv^ 



(4.2.23) 



dv' 

dx'^ 



and therefore 



dx'^ dy^ ^ I 
dy"^ dx' 



(4.2.24) 



(4.2.25) 



So by constructing the covariant derivative we have created a derivative that transforms 
in a nice tensorial covariant way. That is the meaning of the additional terms added to 
the derivative - they preserve the covariance of the derivative under arbitrary coordinate 
transformations by canceling the non-linear terms created when the partial derivative and 
the connection transform. 

To summarize, what we have done in this section is a very specific and very simple 
exposition of creating a derivative that transforms covariantly. The fundamental problem 
with defining a derivative is that the definition of a derivative involves comparing tensors 
in different tangent spaces, and there is no way natural way of doing this. 



A fundamental symptom of this problem is the additional non- tensorial term in (4.2.2 1, 
which tells us that in general the derivative of a tensor doesn't transform in a "nice" ten- 
sorial way under general coordinate transformations. We then took a very simple vector 
field w in polar coordinates and looked at how the frame changes under translations (cf . 
equations ( |4.2.12 >, (4.2.13), and (4.2.141). By using these terms we were able to construct 
a well-defined parallel transported vector from one tangent space to another, and using 
this we constructed a derivative that took into consideration the changes in the coordi- 



nates (cf equation (4.2.21 )). We called this derivative the covariant derivative because we 



claimed that it would transform covariantly. 

We then took one of the components of the covariant derivative and, knowing the 
transformation laws of each part (^|^ and w'^r), wrote out the transformation law for 

V ^w'^ , and found that the additional term {—w'^r) transformed in a way that exactly can- 



celled the additional non-tensorial term in (4.2.21, proving that the covariant derivative 
does in fact transform covariantly as the name suggests. 

We want to generalize this approach to constructing covariant derivatives, but before 



doing so we make a few comments. As we said, (4.2.2) suggests that the problem with 



a normal derivative is that you get an additional non-linear term when you transform a 



derivative. The form of each term in (4.2.21 1 reveals what we will find to be a standard 
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pattern - the covariant derivative is the normal partial derivative plus some other term. 
And as equations ( 4.2.22[ > and (4.2.231 indicate, the additional term will transform in a 
way that exactly cancels the additional non-linear term from the partial derivative. 

We will discuss all this in much greater detail and generality in later sections, but 
for now we comment that because the additional terms that are added onto the partial 
derivative to form the covariant derivative provide a way of comparing tensors in dif- 
ferent tangent spaces in a well-defined way, or in other words of "connecting" tangent 
spaces, we call the additional terms the connection terms. 

4.2.2 Connections and The Covariant Derivative 

We now continue with a more quantitative explanation of what we did in the previous 
section. We will, in essence, repeat much of what we did previously, but more abstractly 
and with more generality. 

As we said above, our goal is to construct a derivative via some way of comparing 
vectors in different tangent spaces in a well-defined way. This will entail finding a way of 
taking a vector at p in TpM and mapping it to a vector at p' in TpiM. We can specify this 
new location as we did before via some curve p = q{T, p). Therefore, instead of ( |2.3.43 >: 



■ dq^ d 
dx^ dx^ 



we will take a more general approach. The basic content of (2.3.43 1 is 



where 



i d 
r — 
dx' 

_d_ _ 



dq^ d 
dx^ dx^ 



(4.2.26) 



(4.2.27) 



(4.2.28) 



In other words, this was a prescription for changing the frame when moving from one 
point to another (and therefore from one tangent space to another). The use of the tangent 
mapping induced by g(r, p) provided a way of "connecting" two tangent spaces. 

To generalize this, consider a vector field v(j)). We want to compare the value at point 
p and at point q{e,p). Or in coordinates, at point q(x(]9)) and q(e,x(]9)). But we can 



naturally rewrite q(e, x(j))) as we did in (4.2.16) 



q(e,x(i))) = x(2)) + 5x 



for some appropriate 5x (cf (2.3.114)). In components this will be 

g*(e,x(2))) = x\p) + 6x' 

Now we can write the two values of our vector field v(j>) as 

v{p) = v(x(2))) 
v(g(e,l))) = v(q(e,x(j)))) = v(x(j)) + (5x) 



(4.2.29) 



(4.2.30) 



(4.2.31) 
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The derivative in the direction will now be 



[v(x(g(e, p))) - v(x(2)) + 5x 

5x^0 OX*^ 



(4.2.32) 



Just as in section 2.3.9 we have the problem that this is a valid form for a derivative in 
principle but there is no natural way of comparing the tangent space at p to the tangent 
space at g(r, p). To fix the problem in section 2.3.9 we used the tangent mapping induced 
by the curve g(r, p) to write v(x(]5) + (5x) as a vector in TpM (as indicated by (2.3.43 1). And 
as we pointed out above, ( 2.3.43[ > is simply a way of "connecting" vectors in two tangent 
spaces. 

Now we can generalize. Instead of using the tangent mapping induced by g(r, p), let's 
say that 



dx^ 



(4.2.33) 



where the new frame is still some linear combination of the old frame. Specifically, let's 
say 



v\^{p) + 5x) = v\^{p)) - q^^dx^ v\-k{p)) 



(4.2.34) 



where the F*;. are some particular set of functions (which may depend on p). The minus 
sign is for later convenience. The content of this is that if we take the vector v(x(j)) + 5x), 
which is in the tangent space T^(e^p)A^, and map it to the tangent space TpM, the new 
components are the old components plus a term proportional to the old components and 
the displacement. Because F*^ can be any set of constants, this is a general expression. 

On the other hand, we can use the standard Taylor expansion for the second term in 
(I4.2.32D: 



V X 



{e,p))) 



t;^(q(e,x(2)))) 
f*(x(p) + 5x) 



f*(x(p)) + 6x^ 



■ dv\^{p)) 



dxi 



(4.2.35) 



Plugging (4.2.341 and (4.2.351 into (4.2.321 (in component form), 

[v(x(g(e, p))) - v(x(p) + 5x 



lim —r 

<5x-»0 OX 



dx^ 



lim -— fe-' 

<5x^0 ox'' 



— + V- v'- 
dxi ^" 



(4.2.36) 



The only term that will survive the 5x ^ limit is the j = k term, so finally the k^'^ 
derivative is 



dx^ 



(4.2.37) 
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Notice that this has the form we expect from the previous section - the partial derivative 
plus an extra term. This is the general form of the covariant derivative, and the terms 
Tl^ are the connection terms because they provides a particular way of connecting two 
tangent spaces. 

We denote the covariant derivative of ^ in the direction as 
More generally we can find the derivative of a vector u = ^ in the direction of a vector 



Next we find the covariant derivative of a form uj = (jjidx\ We know that Vv is a 
derivative and we therefore expect that it obeys the Leibnitz rule. So, taking the inner 

i d 



product of (jj with a vector v = v^-^, we have 



11-^ (jj, -I- 11-^1^ f.^-lH -L f.^-lH I .. 1} 



The left hand side of the second line comes from the fact that the covariant derivative of 
a scalar (o^jW* is a scalar) is simply the derivative - there are no indices to contract with the 
connection and therefore the cormection does not have an effect on a scalar). Then, we 
can take — 1 for one value ofi — n and for the rest, leaving 

{W^u;n)^u^(^-u;r,n) (4.2.40) 



Or more formally 



V^uJidx' = u^^- ^fcTj, ) dx' (4.2.41) 



We can easily extrapolate this definition to an arbitrary tenor of rank (m, n) as follows: 

... Q . . . 

^ ,rpJl,32,— ,Jm _ _Z_rpJl,j2, — ,Jm 

' ki,k2,—,k„ Q^i k\,k2,—,kn 

'^^ ip-'- ki,k2:...,kn ' ip ki,k2,...,kn ' ' ' ' ' ^ ip ''^ ki,k2,...,kn 



iki p,k2,--.,k„ ik2 ki,p,k3,...,k„ ' ' ' ikn ki,k2,---,p 



(4.2.42) 



189 



Finally, notice from (4.2.381 that we can find the covariant derivative of an individual 
basis vector 



d6l_ 



kri 



d_ 



^ ki^J 



(4.2.43) 



So as we move around in Ai the basis vectors change. The way a given basis vector 
Sj G TpM is changing at p G is again a vector (the derivative). And because the 
derivative is a vector in TpA^, we can write it in terms of the basis of Tp Ai, which is 
simply a linear combination of the basis vectors of TpAd. So we can think of F*^ as the i 



th 



component of the derivative of e/j with respect to xK We can actually think of (4.2.431 as 
the definition of the connection coefficients. 

For clarity, the example we did in the previous section with the vector field w in polar 



coordinates gave us (cf. equation (4.2.341) 



^4 

r 



■ —r 



(4.2.44) 



and in Cartesian coordinates the connection was trivial: 



4.2.3 Transformation of the Connection 



(4.2.45) 



In our primary example in section 4.2.1 the connection term for V<^w^ was 



-w'^r. We 

saw in ( |4.2.23[ > that under the transformation from polar to Cartesian coordinates this 
term created a "counter-term" that exactly cancelled the n on-linear /non-tensorial term 
created when ^ 



transformed. We claimed in section 4.2.1 that this is indicative of a gen- 
eral pattern - that the transformation law of the connection cancels out the non-tensorial 
transformation of the partial derivative. Now we prove this. 
We saw in ( 4.2.2[ > that, in general 



Q^k Qy'L Q^l ^ Q^k 



Q2yt 



dy^ dy^ dx^ dx'^ dy^ dx^dx^ 



(4.2.46) 



where w is the vector in coordinates ?/* and v is the vector in coordinates x*. Let's denote 
the connection for the x coordinate system V^-j., and the connection for the y coordinate 

system F*^. If we then denote the basis vectors 



_d_ 
dx^ 



and 



d_ 



(4.2.47) 



we can write (cf. ( |4.2.43| >) 



n^k 



(4.2.48) 
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However, using (2.3.91 we can write 



d dy^ d dy^ 



dx^ dx^ dy^ 



dx^ 



(4.2.49) 



Plugging this into both sides of (4.2.48 1 we get 



Ve.e,- 



V, 



dy^ 



a/ 



a; 



dyP 



a; 



g2yi 



dx^dx^ 
dx'^dx^ 



dx^ 
QyP dy'^ 

dyP dy^ ~i 
az + ——Tkpi^i = 



r. 



dy^ 
dy^ 



a; 



iiQ^k 



a; 



,1 



+ 



dx^ dx^ 
dyP dy''_~i 



kp 



dx^dx^ dx^ dx 
d'^y^ dx'^ ^ dyP dy'^ dx'^ 



'^dx^ 



I 



dx^dx^ dy^ dx^ dx^ dy 



I kp 



. dy^ dx'i 

ijg^k Qyl 



T^Jl (4.2.50) 



ij k 



Or 



QyP Qyk Q^q 



dx^ dx^ dy^ dx^dxi dy^ 



(4.2.51) 

This is the transformation law for a connection coefficient - notice that the first term on 
the right hand side is what we would expect if F* ^, did transform like a tensor. The second 
term on the right is the non-linear term that prevents it from being tensorial. 

So, starting with a general covariant derivative (4.2.391, we can transform it using 
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( |4.2.51| ) and ( |4.2.46| )f| 

' dw 



U-' 



(9|/* dx^ dw" ^ dx'^ 



dx'^ Kdx"" dy^ dxf^ dy^ dx°'dxl^ 



+ 



dx" dx^ dy^ 



dy"^ d'^x'^ 



dyk dy^ ' dx'y dyWy^ 

Qyj Qy^ QxP dx'' dw°' d 
dx^ dx°' dy^ dy^ dxf^ dx'' 



dx^ 



dx" d 
dy^ dx'' 



, dy^ dx" dx'^ d'^y' 



d 



dx'^ dy^ dy^ dx^dx/^ dx 
dy^ dx" dx^ dy^ dy^ dx'' 
dx't' dy'' dy^ dx^ dx^ dy' 
^ dy^ dy^ dy^ dx" d'^x'^ , 



7 5 A. 

d 



u 



+u 



dx^ dx'^ dx^ dy'^ dyWy^ dx'^ 
dw" d J, dx" d'^y^ 



^ " dxi^ dx" 



w 



u 



dw" d 



Jdw" 



dx" 
dx" 



^ dy' dx"dxP 
+ u 



d_ 
dx" 



a„p7 ,,5 9 ^dy^ dy^ d^x^ 



d 



w 



Qrj.<f> ^dx^dy^dy'^ dx 



d 



dy'- dx^dxf^ dx 



w 



/3 



Pa 



W 



_d_ 
dx" 



Qyj Qyk ^2^7 ^ Q 



w 



dx't" dx^ dyidy'' dx'^ 



\ dx^ 

VuW + u 



d 
dx" 



dx^ d'^y'^ dy"' dy" d'^x^ 



+ 



dy"' dx"dx^ dx" dxi^ dy"'dy" 



d 

dx^ 



dx^ d^y"' dy" dy"' d^x^ 



+ 



dy"' dx^'dxP dxf^ dx" dy"'dy" 



_d_ 
dx^ 



(4.2.53) 



So, if the second term in square brackets is equal to 0, then the covariant derivative does 
in fact transform properly. With that in mind, let's look at the second term in the square 
brackets: 



2^S 



dy" dy"' d x 
dxP dx"' dy"'dy" 



dy" dy"' d dx^ 
dxt^ dx"" dx"' dy" 
dy" d'^x^ 
dxl^ dx'^dy" 



(4.2.54) 



^For notational simplicity we are taking everything with a tilde to be in i/ coordinates and everything 
without a tilde to be in the x coordinates. 

^We will be making extensive use of the identity 



(4.2.52) 



etc. 
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Now, starting with (4.2.52), we can find the identity 



d_ 
dx'- 



dx^ dy^ 
dy^ dx^ 



(9x* d y ^ dy 



d'^y'^ 
Qyk Qx^dx^ 



Plugging this into (4.2.541 we have 



dx^ dx^dy'' 



dy'' dx'-dx^ dx^ dx'-dy'' 

Qyk Q2^t 

dx^ dx^dy'' 



dx^ d'^y^ 
dy'y dx'^dx'^ 



(4.2.55) 



(4.2.56) 



which exactly cancels the first term in the square brackets in (4.2.53 1. So, we finally have 

VfiW = Vuw (4.2.57) 

The covariant derivative is in fact covariant. Neither the partial derivative nor the con- 
nection transform in a covariant way, but when they are added together, the non-linear 
parts exactly cancel making the sum covariant. 

4.2.4 Other Connections 

Admittedly there is something a bit arbitrary about how we defined "parallel trans- 
port" in section 4.2.1 We said there that a vector is parallel transported along a line in 
such a way that it maintains its direction in M^. While this may seem like the only (or 
at least the most) sensible way of defining it, keep in mind that we are ultimately inter- 
ested in much more general manifolds than M^. Before generalizing again to an arbitrary 



manifold (as we did in sections 4.2.2 and 4.2.3 ), let's consider an alternative connection in 

Rather than demand that a transported vector keep its direction constant in M^, let's 
say (for example) that the transported vector retains the angle it makes with the radial 
vector: 
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We can work out the details of this in a similar way as we did to get (4.2.101, ( 4.2.12[ >, 
(4.2.13), (4.2.14). Rather than work through the details, we merely cite the results (you 
should work this out to verify on your own). Starting with Cartesian coordinates, moving 
a small distance in the x direction {x ^ x + Sx) gives 



u 



u"" + 6x 



y 



6x 



y 



r^2 _|_ y2 

And moving a small distance in the y direction {y ^ y + Sy) gives 



u 



u 



6y- 



X 



+ 5y 



x^ + y^ 

X 

x^ + y'^ 



So, in the same way that we got (4.2.44), we now have 

pi -nx y 

^ 12 

11 



xy 



XX 



x^ + y^ 

y 



-L 9 

r 



22 

2 
21 



pa; 

yy 



yy 



rj.Z _|_ 
X 

^2 _j_ ^2 



(4.2.58) 



(4.2.59) 



(4.2.60) 
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and all the rest are 0. 

Now we want to find the connection coefficients in polar coordinates]^ To do this we 
use (4.2.51 1. Again, we encourage you to work this out on your own, but we merely quote 
the results. Under r ^ r + Sr, 



u 



u 



(4.2.62) 



And under 



+ 



u 



u 



u'^ + -6^u' 
r 



So the only non-zero connection coefficient is 



(4.2.63) 



(4.2.64) 



The important thing to realize is that this connection is just as good as any other. The 
connection doesn't explicitly say anything about the space - rather the connection defines 
our arbitrary convention for moving vectors around to take derivatives. We get to make 
these rules up however we want - "parallel" can mean anything we want it to mean. 

We will however see soon that on manifolds with metrics there is one connection that 
has a special meaning. We will discuss this later. 

4.2.5 Geodesies 

As a brief review, recall that when we started our exposition of calculus on manifolds 
in section 4.2.1 part of our fundamental problem was that there is no natural way of 



comparing vectors in TpAi to vectors in TgAi for p ^ q. This ambiguity in comparing 



vectors in different tangent spaces led to the non-tensorial transformation law (4.2.2 1. We 



amended this problem through parallel transportation (cf . page 183 ff) - a way of defining 
how a vector at one point is to be moved to a vector at another point. Then the transported 
vector can be compared to the actual vector at the new point and a derivative can be de- 
fined. We called the "instructions" for how to transport a vector the "connection" (cf page 
187 and section 4.2.2). We considered two examples of connections in M^: the trivial con- 



m section 



4.2.4 



nection (4.2.45) (which was ( 4.2.44[ )) in polar coordinates) and the non- trivial connection 



The first of these examples lines up with our intuitive notion of "parallel" 
transportation, where a vector doesn't change direction as you move it around. However, 



''One might be tempted to merely transform them directly to polar coordinates, for example 



cos^ I 



sin^ ( 



(4.2.61) 



However this would merely be the polar coordinate representation of the change of an x direction basis 
vector in the y direction. To find the change of, say, an r direction basis vector in the 9 direction, we must use 
the standard transformation law for the connection coefficients, equation ( 4.2.51| . 
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as we pointed out, the way we parallel transport is ultimately arbitrary. Keep in mind that 
for the more general manifolds we will be interested in later, we don't have the luxury 
of being able to trust our intuition about things. There is nothing natural which says 
that (4.2.441) (keeping their direction the same) provides a better way of moving vectors 



around than the connection in section 4.2.4 (keeping the angle with the radial vector 
the same). 

So what does a connection, or a specific set of instructions about how to parallel trans- 
port, tell us? Recall that the point of parallel transportation is that it provides a way of 
moving a vector around a manifold "without changing it" (page [T83] >. Or, in the language 
we have learned to this point, a vector u is parallel transported in the direction of a vector 



V if the covariant derivative of u in the direction of v vanishes (cf equation (4.2.15 1). This 
(obviously) means that if we start with u and move in the direction of v, then u won't 
change - the actual vector at any new point will be equal to the parallel transported vec- 
tor at the new point. Of course, by "won't change" we mean according to our arbitrary 



definition of what it means for something to not change (notice that (4.2.15) depends on 
the connection - we can make it anything we want by making our connection anything 
we want). 

To be more explicit, we may start with a vector u{p) (that is part of a vector field) at 
point p E J^. Then move to some point q E M. that is in the v direction from p, parallel 
transporting \i{p) with you using whatever connection you have chosen. Then, when 
you get to q, find the actual value of u(g), and compare it to the \i{p) you have parallel 
transported with you. If they are the same then the covariant derivative of the vector field 
u vanishes in the direction v from p to 

This discussion allows us to define an important idea. Imagine that the vector field 
u(p) is defined along a curve q{T) : [a,h] ^ M m M (it can be defined elsewhere on M as 
well - we are only interested in the definition along the curve for now). Let's take u(g(r)) 
to represent the motion of some particle moving on AA, where u(g(r)) is the particle's 



velocity at the point g(r) G M, and therefore (cf section 2.3.9) 



u(g(r)) = ^ (4.2.65) 

(where q(r) is simply the coordinate representation of ^(t)). 

By both physical reasoning and mathematical reasoning, we know that if left alone a 
particle will simply move in the straightest line it can. In other words it's not going to 
change its velocity for no reason and it's going to follow some extremum path in going 
from one point to another. In other words, its velocity vector u(g(r)) will be parallel 
transported at each point, in the direction it is moving. We can capture this more precisely 
by saying that the particle will travel along the path represented by u(5(r)) if 

u'ViU = VuU = (4.2.66) 



^We are, of course, assuming that the path from p to q is infinitesimal - our language is relaxed for 
simplicity. 
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Using the definition of the covariant derivative (4.2.39) to get 



dq^ d dq^ • dq^ dq'' \ d 



dr dx^ dr ^'^ dr dr J dx^ 

(4.2.67) 

The values q^ are simply coordinates, so the first term above can be simplified resulting 



m 



d dq' ■ dq^ dq'' \ d 



which will vanish in general for 

d'^q^ j dq^ dq 



for every i. So, (4.2.691 is a differential equation for q(r), and the solutions will be a 
curve in M. which gives rise to a vector field that is parallel transported along the path 
g(r). Furthermore, this path will be the equivalent to an extremum path for the particle 
to follow - or put less formally, the path that corresponds to the "straightest line" along 
M.. We call such paths geodesies. They are the natural path a particle will follow when 
moving through M., given some particular connection. 

Of course, as we have indicated several times, this means that the "straightest line" the 
particle will follow is completely dependent on the connection. To see a few examples of 
this, let's once again consider the familiar M?, starting with the trivial connection (4.2.451. 
Because all of the connection coefficients vanish this will simply giv^ 

d^x 



dr^ 
dr^ 







(4.2.70) 



Which has the straightforward solutions 



x{r) = Ar + B 

y[r) =Cr + D (4.2.71) 

where A, B, C, D are simply constants of integration. Obviously the geodesies in this case 
will be straight lines. 



^We are switching from q*, the coordinates of the points of the curve, to simple coordinates - nothing is 
lost in this switch because the g' are nothing more than coordinates to begin with. 
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We can set up the equation with the same connection in polar coordinates (4.2.441, 
getting 



'^,+-f4, = (4.2.72) 
This is harder to solve, but you can show that the solutions are of the form 



r{t) = ^/{AT + Bf + {CT + Df = ^{q-y + [qv] 



2 



tan-^f?^')=tan-f-) (4-2.73) 



<p(t) 


1 




r 








= 


d^ 



^At + BJ Vg^ 

Graphing these will once again give straight lines in M^. So, we have shown that the 
straightest lines in given V^-^^ = (in Cartesian coordinates) will be straight lines. 

But what about the connection in section 4.2.4 ? In this case the Cartesian equations 
are very difficult to solve, but the polar equations are easy: 

(j)r = 

(4.2.74) 

We can solve the second easily: 

r{T) = At + B (4.2.75) 
Then, plugging this into the first we get 

0(r) = ^CAr^ + CBr + D (4.2.76) 

There are several classes of solutions to this, depending on the values of A, B, C and D. 
We provide a few examples. For A = B = D = \ and C = we get straight radial lines 
from the center: 
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For A = and B = C = D = 1 we have circles 




So by changing the connection we have radically altered the geodesies, or "straightest 
lines" in this space. The physical meaning of the connection is likely not clear at this point. 
As we have defined it, the connection is nothing more than a precise statement of our 
(arbitrary) convention for moving vectors around in order to take a covariant derivative. 
It is therefore not obvious why such a convention would have such radical geometric 
effects. 

While we will explore this in much greater detail later, we briefly address it now. 
The point is in the fact that a parallel transport moves the vector "without changing it". 



Considering the connection in section 4.2.2 on one hand we can think of this as making 
things move in a strange way around the normal flat by changing the geodesies. A 
physical analogy for this is in electromagnetism. Consider some section of flat spacetime 
with a non-zero value for the electromagnetic field. There is no curvature, and therefore 
the spacetime geodesies are straight lines. However a particle that carries electric charge 
will not move in a straight line through this space - the geodesies it travels on are altered 
by the presence of the field. In other words the metric is unchanged as in the example 
here (the metric in our copy of in this section is the flat metric), but there is an addi- 
tional connection added to the space that has nothing to do with the metric, and therefore 
particles that "see" this connection follow non-straight line geodesies. 

As one final comment, consider the geodesic equation ( 4.2.69[ > (rewritten in terms of 
the coordinates rather than a path g*) 

„ d^x^ ^. dx^ dx^ 

Vuu=^ + /^1^^ = (4.2.77) 
Let's consider an arbitrary reparameterization 

r — . A = A(r) (4.2.78) 

Under this, 

ar dr dX dX 
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And 



X' 



d 



X' 



d 



(A 



r/2 



d 



dX J \ dX J dX^ dX 

So under this transformation the geodesic equation becomes 



(4.2.80) 



+ r 



dx^ dx^ 







(A 



dX^ 

d'^x^ 



+ r 



dX 

dx^ dx^ 



+ (A')'n 



dx^ dx^ 



dr dr 

dx^ 



~dX^ 



+ r 



dx^ ' ^ ^■'^ dX dX 

dx^ dx^ X" dx^ 
^^ItXltX ^ ~J'~dX 



-X'X" 



dX 



If we set / = — ^ then this is equivalent to 



VuU = fu 



(4.2.81) 



(4.2.82) 



rather than our original constraint (4.2.66 1. The difference is that (4.2.82 1 demands that the 
vector be transported in such a way that it is always parallel to itself. Equation ( 4.2.66[ > on 
the other hand demands that it be transported along itself without changing speed. The 



solutions to (4.2.66) and (4.2.82) will be the same paths, but with different parameteriza- 
tions. If we eventually want to take the parameter to be time it will be helpful to choose a 
parameterization which leads to ( 4.2.66[ ) instead of ( 4.2.82[ ) 



4.2.6 Torsion and the Relationship Between VuV and C^v 

Before moving on to consider the relationship between the connection and the geome- 
try of a manifold in greater details, we consider a few aspects of connections that we have 
only considered implicitly so far. We now make them explicit. 



2.3.10 



First, let's briefly review what we said about Lie derivatives in sections 2.3.9 and 
The essential idea of a Lie derivatives is that, given two vector fields, we can 



10 



take the derivative of one (at a point) in the direction of the other. Because we were talk- 
ing about about infinitesimal displacements, there was not a problem in talking about 
moving from a point p G to a point q{e,p) via the infinitesimal displacement due to a 
vector: 



(4.2.83) 



(cf equation ( |2.3.114| )). 

Recall that the idea behind a Lie derivative is to use the flow induced by u to 
provide a tangent mapping for v - it was through this tangent mapping that we were 
able to compare vectors in different tangent spaces. The geometrical interpretation of the 
Lie derivative was then a statement of the failure of a rectangle made from two different 



section 2.3.10). 



paths to close (cf picture on page 78 and surrounding discussion, including discussion in 



10 



You are encouraged to reread those sections before moving on. 
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Now let's consider how the Lie derivative relates to the covariant derivative. First we 
write out both forms: 

f -dv' ^du'\ d 

Surely there must be some way of relating them to each other. The similarity is the pres- 
ence of partial derivatives. In C^v there is an antisymmetrized pair of partial derivatives, 
so a natural guess may be to try an antisymmetrized covariant derivative: 



u' — — : — V-' 



■A 

dx' 



C^+{q,-ri^v^v'^ (4.2.85) 



So, if the connection is symmetric (F^ ^^ = F^^), then 

VuV - Vvu = C^w (4.2.86) 
But when the connection is not symmetric we have the relation (in components) 

V^v' - - C^v' = (r;.fc - ryu^v'^ (4.2.87) 

So what is the meaning of this antisymmetric part of the connection? To see it, consider 
a point p E M with a geodesic passing through it in the 

-^'li (4.2.88) 

direction (e is an infinitesimal vector) p] 

^^The picture is misleading because we have drawn e with a finite length, rather than an infinitesimal 
length. Think of the e vector at p that is drawn as merely pointing in the correct direction - the actual vector 
does not have the finite length that the picture indicates. 
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Now consider another point q separated from p by a small distance 5, or in other words 
separated from x(p) by a small displacement vector 

\ 




Now parallel transport e to g along 5, getting 

T 

jk 



e'* = e' - r^.fc(5^e'= (4.2.89) 



^^The vector 5 will act as a displacement vectors in the exact same way as v^'^ did in equations | |2.3.127 l 



and | |2.3.129 1. Again you should think of 8 as being infinitesimal. We cannot draw an infinitesimal vector. 



so keep in mind that the points p and q are only separated by a very small distance. 
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Of course the vector e' — F^j^.^-'e'^ at q is exactly parallel to at p by definition because we 
parallel transported it. We drew it in a way that indicates that it may have twisted in some 
sense when it was transported, but that doesn't matter - the notion of parallel is defined 
by the connection. The vectors and e* — F'^.^^e'^ should be considered parallel. 
Now consider a geodesic through q in the direction of e'. 




We can parallel transport 5 along the geodesic through p. If we parallel transport it only 
the infinitesimal displacement e, we get 

6'' = 6'- Ti^e^S'' (4.2.90) 
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As mentioned above, these vectors are infinitesimal, and therefore their finite appearance 
in the above pictures is misleading. Because they are infinitesimal, we can redraw this 
more accurately as 




So, as the vector S starts off attached to the geodesic through q. However, as it moves 
along the geodesic through q, the two geodesies (through q and p) begin to twist away 
from each other, resulting in the parallel transported S becoming detached from the 
geodesic through q. The degree of separation is given by the vector T in the diagram. 
To find the exact form of T, we subtract the vector from p to B from the vector from p to 
A. From p to A is given by 

x\p) + e' + 5" = x\p) + e' + 5' - V'e^'' (4.2.91) 
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and from p to i? is 



x\p) + S' + e" = x\p) + 5' + e'' - T};- 5^e^ 



(4.2.92) 



And the difference between them is 



jk 



(4.2.93) 



So, Tj^j = represents how much one geodesic twists away from a nearby geodesic. 

For this reason we call Tj^, the torsion tensor. If two nearby geodesies stay near each other 
along the entire geodesic, then there is no twisting towards or away from each other, and 
the torsion vanishes. A connection in which the geodesies have this property is said to 
be torsion free. Such a connection is symmetric. A non-symmetric connection will have 
torsion. Also, obviously the torsion tensor is totally antisymmetric: 



t: 



jk 



-T, 



kj 



(4.2.94) 



Also, notice that Tjf^ is the exact expression we found on the right hand side of (4.2.871 
above. So finally, we have the relationship (in components) 



(4.2.95) 



Or 



(4.2.96) 



We can take (4.2.95 1 (and/ or (4.2.96 1) as the definition of torsion. 



Recall from section 2.3.9 that the Lie derivative defined the non-closure of a rectangle 



(cf page 78 ). The idea is that if you start atp & M. and take two infinitesimal displacement 
vectors e and 6, you can transport each along the other in two ways: the tangent mapping 
as with the Lie derivative, or with parallel transport as with the covariant derivative. De- 
noting a parallel transported vector with a 1 1 subscript and a vector that has been tangent 
mapped with a m subscript, we have 
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Equation (4.2.95 1 can simply be read off of this as a simple vector diagram. 

We can look at Tjj^ in yet another way. Looking again at the equation for geodesies 







(4.2.97) 



dr2 ' ^i^dr dr 

As with any tensor, we can break F* up into a symmetric and and antisymmetric part: 











- ^i^k 


■pi 


= ^i^k 


pi \ ^ rpi 


pi 


= ^jk) H 





(4.2.98) 



So, ( |4.2.69| > becomes 

d^q' 



+ r:. 



(ig-' dq^ 



dr dr dr 



d^(l' . (r^ . pi N d<l' 



(ir 
d^q' 



dr dr 

dq^ dq^ 



,pi dQ^dq^ -r. 

dr ^'^^ dr dr ^^^'^ dr dr 
■ dq^ dq^ 1 , dq^ dq'' 



dr dr 2 ^'^ dr dr 



(4.2.99) 



The last term in the third line will vanish because it is a sum over all antisymmetric in- 
dices. This tells us that it is only the symmetric part of the connection that contributes to 
the geodesies. 
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In other words, for a given connection, the geodesies, or "straightest possible lines" 
are defined entirely by the symmetric part. The antisymmetric part plays a different role 
- namely that of torsion. The antisymmetric part defines how much the geodesies twist 
relative to each other. We can picture this as 




To see this, consider the two connections from sections (4.2.2) and (4.2.4), given (re- 
spectively) by 



r<f> 
-pr 



cf>r 

-r 



1 

r 



and 



(4.2.100) 



(4.2.101) 



(all other connection coefficients in both vanish). The first is obviously symmetric, so 
there is no antisymmetric part at all. On the other hand, for the second we have 



{<t>r) 



2 (^<t>r + ^rtj}) 



r<j>J 



1 

r 
1 
r 



1 

'2r 
1 

'2r 



{r<t>) 

_Y<t> 



(4.2.102) 
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So for the first one there is no torsion. This makes sense because the geodesies are 
simply straight lines, which we can plainly see don't twist relative to each other. 
On the other hand, the second one does have torsion given by 



1 

r 



(4.2.103) 



By studying the geodesies given in section 4.2.4 you can see that they do indeed twist 



away from each other as you spiral out from the center. 

4.2.7 The Metric Connection 

We started chapter |4| "Manifolds with Metrics", with a discussion of metrics, which 
is not surprising. But once we got to section 4.2.1 you may have noticed that we have 



hardly mentioned the word "metric" since. We mentioned at the beginning of section 4.2.1 
that metrics are still our ultimate focus, and that has remained true. Now that we have 
spent an adequate amount of time discussing connections, we are finally in a position to 
tie metrics back in. 

We pointed out when we introduced connections that a connection can be anything 
- it is entirely up to us to define parallel transport however we want. We exploited this 
freedom to define a nice and "expected" connection which led to straight line geodesies in 
section 4.2.2 and then arbitrarily define a completely different connection in 4.2.4 which 



led to much more exotic geodesies (straight lines, circles, and spirals). However at the end 



of section 4.2.4 we mentioned that if a manifold has a metric, then there is a particular 



connection (or rather class of connections) that is special. Such a connection is said to be 
a "metric compatible", or a "metric compatible connection". 

Our approach will be to assume a manifold Ai comes with a metric gij, and then use 
the metric to put certain conditions on F^^.. Recall from section 



4.1 



that the point of a 



metric is to provide a map from two vectors, say u and v, to the reals: 



of the (component) form 



9-TpM 



(4.2.104) 



(4.2.105) 



(cf equation (4.1.41). Let's say we start with u and v in TpAi with inner product g{u, v) 



gijV^uK We can then choose some other arbitrary vector w in T^M.. There will be some 
geodesic in the direction of w through p. 
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The condition we will impose will be that the inner product does not change as you par- 
allel transport along the geodesic. 
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So, because we are parallel transporting, by definition we have 

Vwu = Vwv = (4.2.106) 

To preserve the inner product along the geodesic, we must have 

= Vw((?(v,u,)) 
= Vw(fi'ji^^V) 

= w^v'u^VkQij (4.2.107) 
Because we want this to be true for any vectors u and v, we must therefore demand 

VkQij = (4.2.108) 
Using equation ( 4.2.42[ >, we can write this as 

^ 99ij 



Vkgij = 



kiymj -L kjiJ' 







(4.2.109) 



If (4.2.1091 is satisfied by V-^^, then the connection is said to be a metric-compatible con- 
nection. 

Next we can exploit the fact that gij is symmetric and write all three {k,i,j) cyclic 
permutations of ( 4.2.109[ >: 



dgjk 







dg, 



J- ij ymk -L ikUmj — U 



ki 



Q^j ^ jkymi ^ jigmk — U 

Now take minus the first one plus the second plus the third (trust us): 



(4.2.110) 







+ 
+ 



dgij 
dgjk 



_ r™n ■ — r™n 

kiilrnj ^ kjiJr, 



g^k ^ Hi^rnj - kjiJmi 

_ pm _ 

ijUmk J- ikymj 

— n — V"^n 
^ jkyrni ^ jiymk 



dx^ 
dgki 



dx^ 
dgij ^ dgjk 



dg. 



ki 



dx^ (9x* dx^ 

+ — ^Tkjdmj + ij^^j — ^%)gn 

-{VTj + ^%}9mk 

dgij ^ dgjk ^ dg^ 
dx^ 9x* dx^ 

'^'^ki 9m j + Tf^j 9 mi 
— '^^^j)9mk 



(4.2.111) 
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We can solve (|4.2.111|) with 



dgjk ^ dgki 



dg: 



kn 



1 



kn 



-.9 



kn 



2^ 

dgjk 



+ 



dx^ 
dgjk 
dx^ 

dgki 



I] I rpm I rpm. 

' ki yni] ^ -t fcj i/mj 



+ 



dgjj 

dx^ dx^ 



qJ, + 2 ^^^^^"^^ ^ '^kj9mi) 



(4.2.112) 



But in general, we have 

■pn 



■pn I pn 



kn 



-,kn 



\9 



kn 



dgjk 


+ 


dgki 


dgij 


dx^ 


dx^ 


dx^ 


dgjk 


+ 


dgki 


dgij 


dx^ 


dx^ 


dx^ 


dgjk 


+ 


dgki 


dgij 


dx^ 


dx^ 


dx^ 



-r 



jk) 



(4.2.113) 



The second term with the sum of three torsion tensors is called the contorsion tensor. 

If we choose a connection that is symmetric, however, the contorsion tensor vanishes. 
In this case we are left with 



-pn 



-.9 



kn 



dgjk _^ dgki 



dg, 



dx^ 



dx^ dx^ 



(4.2.114) 



If a connection satisfies (4.2.114) (implying that there is no torsion), then it is called the 
Levi-Civita Connection. We will see that Levi-Civita connections play a central role in a 
tremendous amount of physics. In fact, general relativity is specifically a theory of Levi- 
Civita connections. 

As a brief comment, the Levi-Civita connection coefficients are also called the 
Christoffel Symbols in a great deal of general relativity and differential geometry lit- 
erature. 

Another way of defining a Levi-Civita connection is that it is a metric compatible con- 
nection in which the torsion vanishes. Notice that if a connection satisfies the metric com- 



patibility constraint (4.2.109 1, there is only a constraint put on its symmetric part ((4.2.109 1 
lead to (4.2.1121). Therefore a connection can be metric compatible (its symmetric part sat- 
isfies (4.2.1121), but it has an antisymmetric part, making it not a Levi-Civita connection. 
Also, a symmetric connection may not necessarily satisfy ( |4.2.112 >, making it symmetric 
but not metric-compatible. Therefore a connection may be symmetric but not Levi-Civita. 
However, a Levi-Civita connection is by definition symmetric and metric-compatible. So, 
we have the following Venn diagram of connections: 
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connections 




For the next large part of these notes whenever we talk about a connection we will 
mean a Levi-Civita connection. Eventually we will generalize to talk about non Levi- 
Civita connections, and we will announce that change at the appropriate time. But until 
further notice, any connection can be assumed to be Levi-Civita, and therefore given by 
( I4.2.114D . 

As a few examples, consider the metrics we found in section 4.1 We had in Carte- 
sian coordinates: 



9ij 



You can plug this into (4.2.114) and get 



^ jk 



1 
1 



Oy i,j,k 



(4.2.115) 



(4.2.116) 



which agrees with what we found in (4.2.45 1. 

Leading up to equation (4.1.341 we found that the Cartesian metric on transforms 

to 



9ij 



1 
r2 



(4.2.117) 



in polar coordinates. You are encouraged to work out (4.2.114) for this on your own, and 
you will see that indeed the non-vanishing components are 



r<f> 
J- j^j. 



<f>r 

—r 



1 

r 



(4.2.118) 
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as in ( |4.2.44 

You are encouraged to work out on your own the fact that the Minkowski metric (in 
3+1 dimensions) 



9ij Vij 



/-I 0\ 

10 

10 

\ ly 



(4.2.119) 



gives 



Next is the metric for a circle (4.1.51 1 

(it is only 1x1 because a circle is 1 dimensional). This will give 

r}, = 0V2,j,A; 



Less trivial is the ellipse metric (4.1.571 

9ij ~ ''^^(4 '^os^ 9 + sin^ t 

which gives 

, 3sin(2g) 
5 + 4cos(20) 



(4.2.120) 



(4.2.121) 



(4.2.122) 



(4.2.123) 



(4.2.124) 



(again, the ellipse is only 1 dimensional so there is only one value each index can take 
the 6 value). 



Next is the metric for S give in (4.1.621, 



9ij 



1 
sin^^ 



(4.2.125) 



This will give non vanishing components 

ye _ 

^ 94, ~ 

Or there is the deformed S"^ in ( 4.1.64| , 



— sin 9 cos 9 
Vt = cot 9 



9ij 



^((A2 + l)-(A2-l)cos(2^)) 
sin^ 9 



(4.2.126) 



(4.2.127) 



^^Notice the consistency here. We found that the metric 14.1.29 1 transformed as a tensor to give 14.1.34 1 in 
polar coordinates. Also we know from 14.2.116 1 that Cartesian coordinates give you a vanishing connection, 
which is also what we found in 14.2.45 1. And, we can use the transformation law for a connection 14.2.51 1 



to transform ( |4.2.116[ into | |4.1.34| 7 
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Here the non vanishing components will be 



(A2 - 1) sin(2^) 



(A2 + 1) - (A2 - 1) cos(20) 

2 cos 6 sin 6 
~(A2 + l)-(A2-l)cos(2e) 

rt = cot e 



Notice that (4.2.1281 reduces to (4.2.126> for A = 1, as expected 



Finally we have the torus metric (4.1.701, 



_ 
~ ^ (i? + r cos 



This will give non-vanishing components 

{R + r cos 6) sin 



r sin 6 



{R + r cos ( 



4.2.8 Metric Geodesies 



(4.2.128) 



(4.2.129) 



(4.2.130) 



Before concluding this section we consider one final idea. In section 4.2.5 we consid- 
ered geodesies, or the "straightest possible path" in the manifold. We said there that a 
geodesic is the path a particle will follow if not acted on by any other forces. We now 
revisit this idea. 

Consider two points on a manifold Ai, p and q. We want to find the path between them 
that extremizes the distance. If we assume that Ai has a metric then the idea of distance 
is well defined. Specifically, the infinitesimal distance between two points is given by 



ds = ^/gijdx^dx^ 



(4.2.131) 



Generalizing this, the distance I from p to q will then be the integral from p to g of 
some path, where we can parameterize the path x*(r) by the parameter r: 



dx^i^r) dx^i^r) 
dr dr 



(4.2.132) 



Notice that X doesn't depend on the parameterization (i.e., if we change the parameteri- 
zation , r t'(t), the integral remains identical). 

This is simply a calculus of variations problem, and we therefore find the extremized 
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path by demanding that the first order term of the variation of the path vanish 

51 



15 







5 



« ^ dx' dx^ \ 2 
9ij 



9ij 



9ij 



dr dr 

dx^ dx^ 
dr dr 

dx^ dx^ 
dr dr 



dr 



dx^ 



dx^ dx^ 



dr dr 



d{6x^) dx^ dx"" d{6x- 



dr dr 



+ 9ij 



dr dr 



dr 



6x' 



. dx^ dx^ 
dr dr 



dx^ d{6x^) 
' dr dr 



dr 



(4.2.133) 



Because (4.2.132 1 is independent of the parameterization, we can now choose our param- 
eterization so as to fix 



without loss of generality. So we are left with the requirement 



(4.2.134) 







dg 



dx'' 



6x' 



, dx^ dx^ 
dr dr 



+ 2g. 



dx^ d{6x^) 
dr dr 



dr 



We can integrate the second term by parts and with appropriate boundary conditions 
ignore the surface term. So, 







dgij j^dx^ dx^ 

Jb 



dx^ 



ox'^ 



dr dr 
dx^ dx^ 
dr dr 







dx' 




g^j 


dr 


d^x"" 


5x-' 


dr^ 



dgij ^ J, dx^ dx^ 



dx^ 



dr dr 



2g„^5x^ -2 



6x-'j dr 

-^^-i^'^SxAdr 
or dr J 

dgij dx^ dx^ 



6x'' 



dgij dx^ dx^ 
dx^ dr dr 



dr^ 

d'^x^ 
2*.^ - 2 



dx'' dr dr 
dgik dx^ dx 



6x^ I dr 



dx^ dr dr 



dr 



14t 



We assume the reader is familiar with variational calculations such as the one here. If you are not we 
strongly encourage you to spend some time reviewing such problems. 

^^Keep in mind that the metric is position dependent {gij — gij{x^{T))) and therefore the variation of the 
path includes a variation of the metric. Furthermore, because the metric is a function of x^, we will use the 
chain rule to take the variation of the metric: 



^Sx' 
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So SI = as long as (dividing by 2 to make the result more transparent) 



9ik 



(fx^ ( dgik 1 dgij \ dx^ dx 



dr'^ 



+ 



d^x^ 
~d^ 

~d^ 



+ 9 



nk 



+ 9 



nk 



d'^x''' 

~d^ 



nk 



ik 



+ 



dxi 

dgik _ 
dx^ 

dgik _ 
dx^ 

dgjk 



2 (9x*^ 

1 dgij 

2 dx'' 

1 dgij 

2 dx^ 

dgij 



dxi dx'^ (9x* 



d^x" 
~d^ 



dr dr 
d/OC"^ doc 
dr dr 
dx^ dx^ 
dr dr 
dx^ dx^ 
dr dr 
dx'dxi 
dr dr 



which is exactly what we had before in equation (4.2.69 1 in section 4.2.5 



(4.2.135) 



4.2.9 Normal Coordinates 

For a given metric, when working with the corresponding Levi-Civita connection we 
can choose a particular coordinate system which simplifies things greatly. 

Consider a point p E J^. If we take a very small neighborhood of p, there will be an 
infinite number of geodesies through p: 




Recall from the discussion at the end of section 

VuU = 
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4.2.5 



that the geodesic equation 

(4.2.136) 



not only specifies the path, but because the right hand side is 0, also specifies the param- 
eterization. For this reason, every geodesic at p specifies (and is specified by) a vector 
V G TpA^. In fact, the geodesies emanating from p are one to one with elements of Tp Ai. 

Consider a point q E M. that is very near to p. Because we have fixed the right hand 
side of the geodesic equation to be (thus specifying the parameterization of a given 
geodesic), the point q will specify one single geodesic starting at p, denoted qp,q{T) that 
satisfies (by definition) 

qpAr)l=, = Q (4.2.137) 




Now, using the one to one relationship between the geodesies emanating from p and 
tangent vectors in TpM, we can use this geodesic qp^qir) to specify a particular vector 
which corresponds (in coordinates) to qp,q{r): 



dr 



(4.2.138) 
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We can then take the vector v, or more properly the components of the vector v, to be 
the components of the point q. This type of coordinate system based on geodesies in the 
neighborhood of p is called the Normal Coordinate system of p, and is only valid in a 
neighborhood of p. 

If we denote the normal coordinates as x, then obviously x{p) = 0. From the discus- 



sion following equation (2.3.115 1, we can then define the coordinates of q (where q is near 
p) to be 

x(g) = e^^-^{p) = Vp,, (4.2.139) 



where v^^ is defined above. 



Then, an arbitrary point "between" p and q on the path qp,q{T) will have coordinates 

q,,,(r) = x(gp,,(r)) = rv^,, (4.2.140) 



(notice this easily satisfies both equations in ( 4.2.137[ )) 



So, within a sufficiently small neighborhood of p we can use normal coordinates to 
specify any point. Now consider the geodesic equation in normal coordinates x. It reads 

dr^ ^ dr dr 
^ d^rviJ diTV^diTV^_ 

^ ^kHJvj,y^^^ = (4.2.141) 
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(where we are stating the position dependence of the F*;, terms explicitly). The only way 
for this to be satisfied in general is for 



jk 







(4.2.142) 



for every z, j, k. In other words, we have shown that if we use normal coordinates the 
connection coefficients all vanish at p. Consequently the covariant derivatives all become 
partial derivatives. 

Also, by comparing (4.2.142 1 to (4.2.114 1, we can see that F*^ = implies 



digjk = 



(4.2.143) 



for every i,j,k. 

As a warning, these relationships are only true for p. Therefore, in these coordinates, 
there is no guarantee that the connection will vanish at any other point, and there is no 
guarantee that any derivatives of the connection will vanish even at p. 

We won't always be using normal coordinates, and you shouldn't assume we are in 
normal coordinates unless we explicitly say so. We will find them useful for several cal- 
culations, but because they only hold for a specific point (at a time), they are only useful 
to get information about that point. 



4.3 The Hodge Star and Integration 

Before moving on to discuss curvature, which is the primary point of this chapter, it 
will prove useful to first consider another important concept that will aid us greatly as 
we proceed. The ideas we will be discussing here are the first in an enormously rich and 
deep brach of mathematics. We will revisit these ideas many, many times throughout the 
rest of these notes and the rest of this series. For now we will content ourselves with a 
very superficial discussion. 

4.3.1 The Hodge Star 

Recall^ from (2.2.14) that the dimension of a given set of p-forms is 



di,n(A''A.)=(;)=^^^ (4.3.1) 

where n is the dimension of Ai. Using the standard identity from combinators for the 
binomial coefficient, 

-( " ) (4.3.2) 

p I \n — pi 



We have that 



dim(A*'A^) = dim(A"~PA^) (4.3.3) 



16- 



We strongly suggest you reread section|2.2|at this point 
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Specifically, the fact that dim(A^AI) = dim(A"~^A1) indicates that the number of basis 
covectors for M'AA is equal to the number of basis covectors for A"~^A^. We can therefore 
say that the basis for K^M. is isomorphic to the basis for K^'^^M.. When we first saw 



this in section 2.2 we didn't have the machinery to exploit this symmetry. Now that we 
understand metrics we are able to understand it more fully. 

The idea is that we want to define a map that takes a basis element in IsPM. to a basis 
element in A" ^'A^ in a well-defined way. We do this by using the Hodge Star Oparator 

^ : K^M A^'-PM (4.3.4) 

Given a p form 

uj = —^ijOi^i^...i^dx'^ A dx"-'' A ■ ■ ■ A dx'^ (4.3.5) 
we operate on uj with -k according to 



where we have used the metric to lower n — poi the indices on the purely antisymmetric 
tensor: 

iii2-ip _ ...n-i Mi2-ipkp+ikp+2-kr, C4 3 7^ 

But care must be taken here. We must be careful about the ordering between the upper 
and lower indices. We will take as our convention that the lower indices come in order 
after the upper indices. For example, we know that e^^"^ = 1 and e^^^ = — 1. If we assume 
that we are in three dimensional Euclidian space, the metric gij = 5ij, so 



So 



and 



Similarly, 



and 



and so on. 



e 



e',: = Skne'^^ (4.3.8) 



= 53^,12n ^ ^123 ^ ^ (4 3 9) 



6^3 = S^ySn ^ ^132 _ _^ (4.3.10) 



L = '^2n^53ne''"" = 6^'' = 1 (4.3.11) 



= <53m52ne'™" = e''' = -1 (4.3.12) 
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As a few examples of this, consider again working in with Euclidian metric. Let's 
start with the 0-form ^cu. The determinant of the metric gij = Sij is 5 = 1. In this case, 
n = 3 and p = 0. Starting with e^^'^ we need to lower all three indices, which simply gives 



eijk- So, equation (4.3.61 gives 



1 

0!(3~ 0)!^ 



-ku = 77^77:; — —^Loeijkdx^ A dx^ A dx^ 



= -uidx A dy A dz + dy A dz A dx + dz A dx A dy 
6 

—dx A dz A dy — dz A dy A dx — dy A dx A dz) 
= u dxAdy Adz e A^M^ (4.3.13) 

which is the natural guess for the 3 form in that a form corresponds to - there is only 
one of each so of course they map to each other. 

Next consider an arbitrary 1-form uj = ujidx^ = ujidx + uj2dy + uj^^dz: 

= - [ui{dx'^ A dx^ — dx^ A dx'^) + uj2{dx'^ A dx^ — dx^ A dx^) + uJs{dx^ A dx^ — dx^ A dx^)) 
= uJidx"^ A dx^ + uj2dx^ A dx^ + u^dx'^ A dx'^ G A^M^ (4.3.14) 

Next we look at the arbitrary 2-form = ^Uijdx^ A dx^: 

= 2 {^i2dx^ — uj2idx^ + u^idx'^ — ui^dx'^ + U23dx^ — uj32dx^) 

= ujudx^ + uj3idx^ + uj23dx^ G A^M^ (4.3.15) 

And finally, the arbitrary 3-form u; = ^cuijkdx^ A dx^ A dx^: 

3!(3-3)! ^ 

= 77(^^123 + "^231 + 1^312 ~ 1^132 ~ 1^321 ~ 1^213) 
D 

= W123 e A°M3 (4.3.16) 

On the other hand, consider M.^ with a Minkowski metric g^j = r]ij = diag{—l, 1,1,1). 
Now a 0-form will become a 4-form - you can work out the details yourself (the indices 
run from to 3, where x° = t): 



-kU! 



-ujT]air]bjVckVdi(^°''""^dx' A dx^ A dx'' A dx' 



0!(4-0)! 

= -udt Adx AdyAdz (4.3.17) 
You are encouraged to work out the rest in Minkowski space on your own. 
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As a few other examples in three dimensional Euclidian space, consider two covectors 
uiidx^ and aidx\ In equation (2.2.331 we commented that the wedge product between 
them looks like the cross product. Specifically, 

(dy A dz dx A dz dx A dy\ 
uoi LO2 (4.3.18) 

ai a2 as / 

Consider now the quantity A ol). You can write out the details yourself (expand out 
the determinant and then apply -k on each term): 

(dx dy dz\ 
uoi U02 (4.3.19) 
ai a2 as J 

which is the exact form of the cross product in three dimensions. 

You are encouraged to convince yourself of the following generalized relations in three 
dimensions, but we merely quote the results (we are denoting a p-form as uj(^p)). 

gradient : Vco'(o) = c?c<j(o) 
curl : V X uj^^ = i<duj{i) 
divergence: V-a;(_?) = i^d-kuj^^i-^ (4.3.20) 



The difference between the left hand sides and the right hand sides of (4.3.20) is that the 
right hand sides are unique to 3 dimensional Euclidian space, whereas the left hand sides 
are totally general for any arbitrary manifold. We will see how they allow us to generalize 
things like Maxwell's equations later. 

4.3.2 Invariant Volume Elements 

We make one final brief comment before moving on to discuss curvatur^^ Consider 
an n dimensional manifold A4. with coordinate functions x' for z = 1, . . . , n. At any given 
point p a basis for !s}TpM., the cotangent space at p, will be the set 

dx' i = l,---,n (4.3.21) 



We can then put these together to form a volume element as in equation (2.3.861. In 
Cartesian coordinates for this was 

dx Ady A dz (4.3.22) 

and in spherical coordinates for this was 

sin (t)dr AdO Adcf) (4.3.23) 

More generally we can write the volume form as 

hdx^ Adx'^ A-- - A dx"" (4.3.24) 



17 



It would be helpful to reread section 2.3.8 at this point 
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where h is some (scalar) function that makes the integral a legitimate volume form. It was 
h = 1 for Cartesian coordinates and h = r"^ sin (p in spherical coordinates. 

But, consider the transformation law for this - under the transformation from coordi- 
nate functions x' to coordinate functions y\ we have (cf equation (2.3.81 1) 

dx 



h{x')dx^ A ■ ■ ■ A dx" = h'{y') 



dy 



dy^ A-- - A dy''' 



(4.3.25) 



where the lines | | indicate a determinant. We left off the indices on the determinant term 
both for brevity and to emphasize that it is a determinant and therefore a scalar - it has no 
indices. You should note that the x and y do have multiple components and a determinant 
is being taken over those components. 
Let us make the permanent definition 



h{x' 



\9\ 



(4.3.26) 



where \g\ is the determinant of gij as expressed in the coordinate functions. The trans- 
formation law for gij is (denoting the metric in ?/* coordinates as gij) 



9ij 



Qya Qyb ^ 



dx^ dx^ 



J gab 



So the determinant 



Qya Qyb ^ 



dx^ dx^ 



gab 



Qya 



And therefore 



dx' 



dy 



dy'' 



dx^ 



dy 



dx 



dx 



And therefore the entire volume form we have defined transforms as 



(4.3.27) 



(4.3.28) 



(4.3.29) 



Adx^ = 


dy 


dx 


dx 


dy 




VW\dy^ 



\g\dy' A ■ ■ ■ A dy" 



(4.3.30) 



So it is indeed invariant. 

This is consistent with what we did above with ( 4.3.22| > and ( |4.3.23 >. The determinant 
of the Euclidian metric in Cartesian coordinates, g^j = 6ij, is obviously \/\g\ = 1, which is 
what we had. Then, from ( 4.1.36[ ), for spherical coordinates we have 



1 
det I r^sin^i 




sin^ - 



sin ( 



(4.3.31) 



which is also what we had above for spherical coordinates. 
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The purpose of this is that we now have a way of defining an integral over any (ori- 
entable) manifold ^A that is invariant under coordinate transformations. We will denote 
this invariant volume element the volume form fix, and if we want to integrate an arbi- 
trary function / over ^A, we define the integral to be 

f fQj^= I /v^rfxMx^-.-rfx" (4.3.32) 

Finally, consider the Hodge dual of the 0-form 1: 



★ 1 = ^^^^^ej^j2-j„dx^^ A dx^^ A ■ ■ ■ A rfx^" 

= \/\g\dx^ A rfx^ A ■ ■ ■ A c?x" (4.3.33) 
So we can rewrite the integral of an arbitrary function / over Ai as 

f^l (4.3.34) 

M 

4.3.3 Hodge Star Properties 

Consider a p-form u: = ^oji^i^.-i^dx''^ A rfx'^ A ■ ■ ■ A dx"^^ on an n dimensional manifold 
Ad with Riemannian metric. For simplicity we will take \/\g\ = 1. Taking the dual: 

^ u; = A dx^^- A ■ ■ ■ A dx^^ (4.3.35) 

Let's then take the Hodge dual of this: 



1 



_^^i^^n---v™.+r-.,.p^ . . . g^^^^g^^,^ . . . g^^^^dx^^ A ■ ■ ■ A dx'^ 

( — \\p{n-p) 



p\{n — p)\ 

(_1)P("-P) 

p\{n — p)\ 
(_l)p("-p) 

p\{n — p)\ 

(_l)P(n-p) 



\p{n-p) 



- A ■ ■ ■ A dx'^ 

uji^...i dx'^ A ■ ■ ■ A cix*^ 



ly^'^-P^uj (4.3.36) 



So T^TTir is the identity operator up to a sigij^ This is expected since -k takes a form of rank 
pton — p, and then n ~ p back to n — (n — = p. 



^^To see the 5*'* equality write out a few simple examples. 
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It is instructive to work out that on a pseudo-Riemannian manifold with a Lorentzian 
metric (one negative and the rest positive so that det(g) = — 1) we have 

(4.3.37) 

This makes finding the following identity maps easy: 

1 = -k -k Riemannian 

1 = Lorentzian (4.3.38) 

And therefore 

= (— ^ Riemannian 

= Lorentzian (4.3.39) 

Now consider two p-forms (on an n-dimensional manifold) uj and a. We discussed 



the wedge product of lj and ct in section 2.2 Obviously if p is greater than | then uj A a. 
will vanish identically. However, consider the quantity uj A i^cx: 



- ^ a;i,...i^«j,...,-^ei;-;^!'..,^ A • • • A dx'^ A rfx^^+i A ■ ■ ■ A 



(p\y(n — p) 



(p!)2(n-p)! '^■■"^ 

= (4.3.40) 

pi 

where, of course, ^Im is the invariant volume form on the manifold. Notice that had we 
swapped uj and a. in the above expression we would have gotten 

aA^uj = ^a^,...iy'■■■'^nM (4.3.41) 
p\ ^ 

And therefore 

UJ A i<a = a A i<uj (4.3.42) 
And because this is a well-defined n form, its integral 

u; Ai<cx (4.3.43) 



IM 

is well defined. It will prove useful to define an inner product between two forms accord- 



ing to (4.3.431. In other words, the inner product between an n-form a and an ra-form f3 
is 



{a,i3)= J aAi<i3 (4.3.44) 
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Now consider the exterior derivative of A ika where ck is a p form and is a p — 1 
form: 



d{u; Ai>ra.) = (rfcj) A -A-o; + (-1)^ ^u;A{di>:a.) 



(4.3.45) 



(cf rules for exterior derivative on page 60 1. Then (assuming this is a Riemannian mani- 
fold) we can insert the identity operator (first equation in (4.3.38 )). Noting that d * a is an 
n — p + 1 form, this gives 



(duj) A - A * d * a) 

(-l)"^'+"+^*ci*a = d^a 

Now we can write ( 4.3.46| > as 

d{(jL> A -ka) = {d(jL>) Aika — u; A {-kd'^cx.) 



Now define the operator 



where 



(4.3.46) 



(4.3.47) 



(4.3.48) 



(4.3.49) 



Finally, integrate d{u: A ^cx) over a manifold without a boundary dAi = 0. By Stokes 
theorem this will vanish. So, 

/ d{uj A -ka.) = / {{duj) A-kcx-uj A {-kd^a.)) 
Jm Jm 



(jj A i^a. 



dM 



And therefore 



/ {duj) Ai^a = uj A {'kd'^OL) 
Jm Jm 



(4.3.50) 



(4.3.51) 



This result will be useful later. 

Looking briefly at the operator ( 4.3.48[ >, which is called the Adjoint Exterior Deriva- 
tive, consider what it does to a p-form. The -k takes the p-form to an n — p form. The 
exterior derivative then takes the n — p form to an n — p + 1 form. The next ★ then takes 
the n — p + 1 form to an n — (n — p + 1) = p + 1 form. So, 



S : K^M — ^ NP+^M 
Finally consider the square of d"^: 

So d'^ is, like d, nilpotent. 



(4.3.52) 
(4.3.53) 
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4.3.4 The Hodge Decomposition and Harmonic Forms 



Before moving on, we will plant a few seeds for ideas that will be significant (and 
remarkably powerful) later in this series. We discussed cohomology in some detail pre- 



viously (section 3.3 ). Recall that the "point" (so to speak) of cohomology stems from the 



nilpotency of the exterior derivative: 

d"^ = (4.3.54) 

We used this to draw a distinction between forms that are exaclj^ and consequently 
closecQ trivially, and forms that are closed but not exact. We saw that the existence of 
forms that are closed but not exact indicates a "hole" of some sort, or in other words that 
the manifold is topologically non-trivial. 



In the previous section we defined a new differential operator, d'^, and in (4.3.531 
showed that d'^ is nilpotent. This leads us to consider the cohomology of d'^, which we 
from now on call the co-exterior derivative. 

Following what we did with the cohomology of d, we define a form as coexact if it 
can be globally written as the co-exterior derivative of another form. In other words, u; is 
coexact if there exists a form a such that 

= Sol (4.3.55) 
holds globally]^ We then define a form as coclosed if 

Suj = (4.3.56) 



Then, just as in section 3.3 we call the set of all coexact n-forms B^^ (M) and the set of 
all coclosed ra-f orms Z"^^ (7Vl)j | This allows us to define an alternative cohomology group 

{M) ^ Zl {M)IEI (M) (4.3.57) 

The "meaning" of this construction of cohomology is similar to the meaning of the d coho- 
mology of section 3.3 and we won't delve into the details herej^ other than to point out 



that d mapped n forms ton + 1 forms, and d'^ maps n forms to n — 1 forms, which changes 
the direction of the cohomology. 

However, there is an important relationship between exact forms and coexact forms. 



We said in equation (4.3.44) that the inner product between two n-forms is given by 

(cK,/3) = aAi.f3 (4.3.58) 



^^A form u) is exact if it can be written globally as the exterior derivative of another form: lj — da. 
^'^A form a; is closed if its exterior derivative vanishes: du) = 0. 

Again, the requirement that this be global is extremely important. 
^^From now on we will use a d or d^ subscript to specify whether we are doing d cohomology or d^ 
cohomology. 

^^We will later in a much more general setting. As you can imagine we are skimming the peaks of 
enormous mountains here. 
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So consider the inner product between a closed form an a co-closed form: 

{dct, d^l3) = J dctA *d^/3 (4.3.59) 



where clearly ck is an n — 1 form and /3 is an n + 1 form. Now according to ( 4.3.51[ > we can 
rewrite this as 



{da.,d^l3) = j dcxM<d^d^(3 



da AO 

= (4.3.60) 

So, exact forms and coexact forms are orthogonal to each other. This tells us that we can 
decompose an arbitrary form into a part that is exact, a part that is coexact, and something 
else that is neither exact nor coexact but thus far undefined. In other words, if we define 
the set of all exact forms as B'^{M) and the set of all coexact forms as B"^^ (M), 

A^M = B2iM)® B^,{M) © (?) (4.3.61) 

where we have left the final part a question mark because we assume that there must exist 
forms that are neither exact nor coexact, but we don't know what kind of forms they are. 
So, we want to find the set of forms that are orthogonal (according to the inner product 
(|4.3.44[)) to both B^{M) and B^^iM). 



Consider a form 7. We want to find the most general 7 that satisfies both of the fol- 
lowing!^ 

= (7, d^f3) = y 7 A ^d^f^ (4.3.62) 



Then, using (4.3.51 ) on both, we have 



djAi</3 = (4.3.63) 

So, in order for this to be satisfied for arbitrary a and /3, it must be the case that 

^7 = rft^ = (4.3.64) 

So, 7 must be both closed and coclosed. But we can make an important observation - if 7 
is an p-form, then ^7 is a p + 1 form and d'^j is a p — 1 form. Furthermore, both dd'^j and 
d'^dj will be p-forms. If 7 is closed and coclosed, then we can take 

{dU + dS)-i = {d + Sf-i = A7 = (4.3.65) 

^^We write them in opposite order with foresight - to make the result follow in a more obvious way. This 
is allowed because it is an inner product - (a, (3) = (/3, a) in general. 
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where 

A = {d + Sf = Sd + (4.3.66) 

is called the Laplacian operator]^ 

So, if 7 is both closed and coclosed, then A7 = 0. On the other hand, if we assume 
that 

A7 = (4.3.67) 

then it must be the case that 7 is both closed and coclosed. To see this assume the contrary, 
that it is closed but not coclosed. This means that ci^7 is not zero, but is equal to some other 
form, say d'^^ = uj. So, 

= A7 

= {d'^d + c?rf"^)7 
= dd'^^ 

= duj (4.3.68) 

Because this is not true in general we have a contradiction, and after using the same 
argument where we assume u is coclosed but not closed, we can conclude 

rf7 = rft^ = ^ A7 = (4.3.69) 



So we have traded the two constraints (4.3.641 for the single constraint (4.3.671. 
We denote any form that satisfies 

A7 = (4.3.70) 

a Harmonic Form. We denote the set of all harmonic ra-forms on M. as W^^AA) 

So finally, because we know that B]^{M) is orthogonal to B^^{M), and both are or- 
thogonal to H2^{M.), we have the decomposition 

A-(M) = B^iM) © B^tiM) © HUM) (4.3.71) 

This decomposition is called the Hodge Decomposition. It essentially says that just as 
any matrix can be decomposed into an antisymmetric part, a traceless symmetric part, 
and a pure trace, any form can be decomposed into an exact form, a coexact form, and a 
harmonic form. This is true of all forms on an arbitrary differentiable manifold. 



^^It is called the Laplacian because if you write it out in three dimensional Euclidian space with metric 
gij = it is simply 

/ q2 q2 q2 \ 

y'f{x,y,z)^^— + — + —y{x,y,z) 
So, A is simply the generalization of to an arbitrary n-dimensional manifold with an arbitrary metric. 
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4.3.5 A Bit More Topology 

A natural question to ask why 'why have we gone to the trouble of decomposing an 
arbitrary form into these three orthogonal parts?' (4.3.71 1. 



Consider a ri-form u; that is in the rf-Homology group if^(A^). From section 3.3 
know that this means that is closed: 



we 



du; = 

but there does not exist any n — 1 form ex such that 

u} = dcx 



(4.3.72) 



(4.3.73) 



or in other words, uj is not exact. Knowing from the previous section that we can decom- 
pose an arbitrary form u into the exterior derivative of an n — 1 form ck, the coexterior 
derivative of an n + 1 form (3, and a harmonic form 7, we write 

uj = dcx + Sp + -f (4.3.74) 

The fact that uj is closed means that 

duj = d^cx + dd^P + d'y = dS (3 = (4.3.75) 

But then, using the inner product ( 4.3.44[ > and the relationship (4.3.51 1, this implies 

= (0,/3) 
= {dd^(3,(3) 
= {d^f3,d^f3) 



which implies 

d^l3 = 

and therefore uj E H'^{Ai) is generally written as 

Lo = da + 7 



(4.3.76) 



(4.3.77) 



(4.3.78) 



Now consider some other element lj' E H2{M.). By definition this means that there 
exists some form 6 such that 



uj' = UJ + de = da + 'y + de = d{a + 6) + j = dif) + j 

where 

i/j = (x + 6 
Notice that the only difference between 

UJ = da + 7 



(4.3.79) 



(4.3.80) 



(4.3.81) 
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and 

u:' = dil^ + -f (4.3.82) 

is in the exact form. The harmonic form is the same in both cases. In other words, every 
representative of a given equivalence class in H'^(A4) has the same harmonic form, which 
means that there is only one harmonic form for a given equivalence class in H'^(Ai). 

What all of this allows us to say is that given any element lj E H2{M.) such that u 
decomposes as written above, we can choose 6 above to be = —a, so 

Lj' = 7 (4.3.83) 

This means that every equivalence class, or element, of H^iAi), has a single harmonic 
form which can represent it. So, 

H^{M)C HUM) (4.3.84) 

Looking at the definition of the n*'^ cohomology group, 

H-{M) = Z-iM)/B-{M) (4.3.85) 

Because 7 is harmonic as we have assumed, it is therefore also closed (and coclosed). 
Therefore 

7 e Z-,{M) (4.3.86) 

However, by definition 7 is not exact, and therefore 

7 ^ 5,"(-M) (4.3.87) 

And so, by the definition of H^iAi), this means that 7 is a non-trivial element of H^iAi), 
so 

Hl{M)(Z Hd{M) (4.3.88) 

Finally, comparing (4.3.841 to ( 4.3.88[ >, we have the remarkable result 

Hl{M) = H'J{M) (4.3.89) 

where the = should be understood as stating an isomorphism. What this result tells us is 
that there is exactly one harmonic n-form for every equivalence class of H^{Ai). 
Recall from section 3.2.10 that we had the string of equalities 

b''{M) = dim{H2{M)) = dim{H^{M)) = bn{M) (4.3.90) 

where b'^{Ai) = bn{Ai) where the Betti numbers and Hn{Ai) is the n homology group of 
Ai. We then showed that the Euler number could be written as 

00 

X{M) = J2i~^yb'{M) (4.3.91) 

i=0 
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We can now add to this string of equalities 



dim{Hl{M)) = dim{H^{M)) = (4.3.92) 

And therefore, 

oo 

xiM) = Y,h^ydtm{HliM)) (4.3.93) 

i=0 

Once again we see a remarkable equation - the left hand side is a purely topological quan- 
tity, while the right hand side is a purely analytic property. The left hand side has abso- 
lutely nothing to do with anything geometrical, whereas the right hand side can't even 
be defined without a specific metric! 



We have derived (4.3.93 ) for a variety of reasons, and as you might imagine it is a very 
simple and very specific case of a much more general idea. It will be useful in some of the 
physical applications we will consider later, but it is also to provide a simple illustration 
of much, much more profound and far reaching ideas. Later in this series we will devote 
a tremendous amount of time considering these types of relationships between global 
topological properties and local differential/ geometric properties. We will find that such 
analytic relationships provide a profound gateway between topology and geometry, and 
will be a primary tool in more advanced topics, especially string theory. 



4.4 Curvature 

4.4.1 First Intuitive Idea of Curvature 

We are finally able to talk about curvature, one of the most fundamental ideas in both 
geometry and, as we will see, physics. 

As a less technical and more intuitive introduction to this topic, we begin with a more 
qualitative discussion of how curvature is measured. To understand the meaning of both 
the Lie derivative and the torsion tensor, we started at a point p E M and moved to 
another point on Ai through two different paths. In general, the result of going around 
the two paths is different, and this difference is the quantity we are interested in. With the 
Lie derivative, the lack of closure of the relevant rectangle indicated the degree to which 
one vector field changes as you move along another. With the torsion tensor, the lack of 
closure of the relevant rectangle indicates the degree to which two geodesies twist relative 
to each other. 

Learning by example, let us consider a similar situation. However, instead of merely 
moving along two different paths, let us carry something along the two paths. Consider, 
as usual, M^. Let's take a random vector in M and, using the trivial connection (F^^^ = 0), 
let's parallel transport it along two different paths: first in the direction some distance 
6x, then the y direction some distance 6y, 
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Then we can do this in the opposite order. 




Notice that after both paths the vector is the same. In other words, the difference in the 
vector after being parallel transported through each path is 0. 

On the other hand, consider 5^ with an arbitrary tangent vector at the north pole. 
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We will define parallel transport such that the angle between the vector we are transport- 
ing and the tangent vector along the curve we are transporting along maintain the same 
angle. With that said, let's parallel transport along the left side of the diagram down to 
the equator: 




then transport this around to the front of the sphere: 




We can then transport down the right side, then to the front: 
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So, under these two paths the result is different. A moment's thought will make it clear 
that the reason for this difference between and S'^ is due to the fact that S"^ is curved, 
whereas is not. 

This is the approach we will take to study curvature. We will parallel transport an 
arbitrary vector at an arbitrary point on Ai around two different paths. We will take the 
difference in the resulting vector at the new point to be a measure of the curvature of ^A. 

4.4.2 The Riemann Tensor 



We now do mathematically exactly what we did in the previous section pictorially. 
As we noted above, we have found tremendous success in moving around rectangles. 
Previously, in studying the meaning of the Lie derivative and torsion tensor we simply 
used displacement vectors to move a point around the rectangle. Now we will move a 
vector around it. 

that as long as we assume torsion vanishes (F^j^, = Tip 



Recall from section 4.2.6 



parallel transporting a pomt along the infinitesimal vectors e then 6 will send you to 
same point as parallel transporting along S and then e. 

So consider a vector v at p G M, along with vertices atp,p + e,p + 6, and p + 6 + e, 



^^The calculation that follows, as well as our entire discussion of curvature, will depend on the assump- 
tion that there is no torsion. We will eventually generalize this, but for now torsionless manifolds are 
sufficient for our purposes. 
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Then we can parallel transport V to p + Se, gettin^^ 

v'\p + e) ^ v"\p + e + 6) = v'^ip + e) - r]^{p + e)6^v'\p + e) 

= v\p)-T],ip)e^v'ip) 

-(r;.,(p) + e'dir),{p))6^{v\p) - rl^{p)e-^v-{p)) 

(4.4.2) 

(where we have kept terms only to first order in e and 5, and we dropped the argument 
(p) in the last line): 

^''We don't have to both with parallel transporting S to the point p + e because we have assumed that 
there is no torsion on our manifold. We could redo all of this, carefully transporting 8 and e as well as v, 
but because there is no torsion it wouldn't change our answer. 
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We can then go in the opposite order, first getting 

v'ip) v"'\p + 5)= v\p) - V)^{p)5H\p) (4.4.3) 
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and finally getting 

v"'\p + 6) v'"" = v'- Ti,^6'v'' - Vyh^ + r;.;- r^„e^(5™t;" - dir.^b'e'v'' (4.4.4) 
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We want to know how much the vectors at p + S + e differ, and we therefore take the 



difference between (4.4.21 and (4.4.41: 



(4.4.5) 



where we switched around the summed dummy indices to get the second to last line. The 
tensor 



K 



■jki 



Pi >5 _L V"^ F' F 



kj 



km Ij 



Im kj 



(4.4.6) 



is called the Riemann Tensor, and it is a measure of how different a vector is after it has 
been moved around two different paths. Or, based on our considerations in the previous 
section, it is a measure of the curvature of the manifold: 
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Before moving on, we briefly consider the symmetries of the Riemann tensor. No- 
tice that in the picture above we denoted the Riemann tensor as -R'(v, e,<5) instead of 
*e*^(5'. In doing this, we are merely emphasizing the fact that the Riemann tensor has 
three covariant indices and one contracovariant index, which we can take to mean that it 
is an object that maps three vectors to one vector: 

R%i : Tp{M) ® Tp{M) ® T^{M) Tp{M) (4.4.7) 

Specifically, R'iy, e, S) = Rj^iV^e'^SK 

From this we can plainly see several symmetries and antisymmetries. The most ob- 
vious is between the e and S vectors - if we had switched the order the -R*(v, e, 5) vector 
would point in the opposite direction. We should therefore suspect that i?*^^ is antisym- 



metric in the last two indices. Looking at (4.4.6 1, we see that this is indeed the case. So, 



R^ki = -R]ik (4.4.8) 

Another useful identity is to use the metric to lower the first index of the Riemann 
tensor: 

Rijki = g^nR]kl (4.4.9) 
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The symmetries of this new tensoip^are 



Riklm R-ikml Rkilm 

Riklm Rlmik 

Riklm + Rilmk + Rimkl = (4.4.10) 



Finally, if we work in normal coordinates (cf section 4.2.9 1 we can show that 

R]ki = 9kT]j - diTlj 

R.,u = l[dA9u-dA9,-dA9^k + dA9k,] (4.4.11) 

Before moving on, we illustrate the meaning of the Riemann tensor one other way. If 
we start with a vector v at p and parallel transport to point p + e we can think of this as a 
Taylor expansion, but instead of using the normal partial derivative we use the covariant 
derivative: 

v{p) — > v(p + e) = ^{p) + ^e-v{p) + ■■■ (4.4.12) 

Then dragging this along to p + e + 5, 

v{p + e) — > v{p + e + 6) 

= v(p) + Vs^rip) + V,v(2)) + VsVe^r{p) + ■■■ (4.4.13) 

On the other hand, had we done S then e, we would have 

y{p) — >Y{p + 6 + e) = y{p) + Ve^r{p) + Vs^ip) + V^Vs^ip) + ■■■ (4.4.14) 

So the difference in these two vectors (to first order in S and e) 

{^(p) + ^s^ip) + ^e^ip) + V5V,v(2))) - (v(;^) + VMp) + ^s^{p) + ^e^s^ip)) 

= [Vs,VMp) (4.4.15) 



^^Don't worry about the physical meaning of this tensor for now. We are merely quoting these results for 
completeness. 
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If we write this expression out (in components) 



d_ 
dx^ 



_d_ 
dx' 



d f dv 



dx^ \ dx 
d f dv 



, + 1 -rf,( 



9x* V dx^ 



dx' 



which is exactly what we had above in (4.4.6). So we can tentatively say that 

4,5VV = [V,,V.]^'= 
or in the other notation mentioned above, 

R\^r,6,e) = [Vs,V,]v' 



(4.4.16) 



(4.4.17) 



(4.4.18) 



We included the word "tentatively" above because there is one major assumption we 
made in the derivation of ( 4.4.16[ > - namely that the relevant rectangle is actually closed. 
The point is that we are comparing the vector v after going to the same point around two 
different loops. But if you don't arrive at the same point after going around those two 
loops then the comparison of the resulting v is meaningless. If, for example, 

Cse ^ (4.4.19) 

then this won't work because the resulting v's will be at two different points (cf pictures 
on pages 243 and 78 1. Fixing this problem is very easy - if the Lie derivative is non-zero, 
then the locations of the parallel transported v's after each loop will be separated by a 
distance Cs^, and therefore we can parallel transport one of the resulting v's to line up 
with the other by simply using the same Taylor expansion we used to "move" the v's to 
get ( 4.4.16[ >. In other words, the corrected expression for the Riemann tensor is 



/?'fv,(5,el 



([V5,Ve] 



(4.4.20) 



We can take this to be the general definition of the Riemann tensor. In fact, this is a general 
expression for any curvature tensor, whether it comes from the Levi-Civita connection or 
not. 
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4.4.3 Second Intuitive Idea of Curvature 

The Riemann tensor is the first, and in many ways the most fundamental, notion of 
curvature we will work with. However there is another way to quantify curvature that, 
while related to the Riemann tensor, has a unique interpretation. 

Consider two particles in at points p and q with Euclidian metric gij = Sij, initially 
separated by a distance a, each moving parallel to each other along the x axis towards the 
origin: 



If we take the geodesies in with g^j = Sij through p and q in the direction of these 
vectors, parallel transporting the vectors along those geodesies (respectively) will give, at 
a later time. 
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q 



They are still separated by the distance a. 

Now consider a similar situation but instead of we work with S*^ with metric 
( |4.1.62 >. Start with p and q at the "equator" and point both vectors towards the "north 
pole". 
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If we parallel transport these in a similar way, we end up wit 




Notice that in this case the vectors, which started out parallel and separated by a distance 
a, are now at the same point. 

This property, of vectors starting out parallel to each other, each moving along parallel 
lines, but still intersecting, is a direct consequence of the Riemann tensor. The vector 
initially at p can be parallel transported along as drawn in the above pictures. On the 
other hand, it can be parallel transported along a to the point q, resulting in vector q, 
which is then parallel transported to the new point. This is equivalent to what we did 
to derive the Riemann tensor, only instead of parallel transporting the vector along two 
paths of a rectangle, here we have parallel transported it along two paths of a triangle. 
The result is the same - the degree to which they are no longer parallel is a measurement of 
curvature, and is quantified by the Riemann tensor. In this sense, we have done nothing 
new. 

However, we can approach this a different way. Looking again at M?, consider a vol- 
ume V located between p and 

^^Please forgive the sloppy artwork. 

^^Oi course, because we are in 2 dimensions, a "volume" is an area. We are saying "volume" because we 
will eventually generalize this to an arbitrary manifold. 
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With a metric this becomes straightforward to write the volume of this - assuming p and q 
are still separated by a distance a, the volume (area) will be (assuming it is a circle, which 

2 

is a reasonable assumption at this point) Now choose any point in the volume and 
choose a vector at that point. We can parallel transport this vector to every other point in 
the volume to define a class of vectors - one for each point in V. 
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If we parallel transport each of these points along their respective vector, we end up with, 
at a later time. 




the volume will be the same. 
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On the other hand, if we had done the same thing on S'^ as before. 




we find (from this admittedly crude graphical approach) that the volume changes as each 
point of V moves along its geodesic. 

Of course, we could have started with a volume near the north pole and parallel trans- 
ported each point in V down towards the equator, and the result would have been V 
increasing in size. 

So on the flat the volume did not change, whereas on the curved S*^ the volume did 
change. It turns out that this property, volume deviation, is yet another quantitative mea- 
surement of curvature. The specific quantity by which the volume changes is measured 
by what is called the Ricci Tensor. As you might imagine, it is intimately related to the 
Riemann tensor. However, in the next section we will derive this volume deviation term 
independently and then show that/how it is related to the Riemann tensor. 

4.4.4 Ricci Tensor 

We want to make what we said in the previous section concrete. We start with a collec- 
tion of particles clumped together near each other in some part of a manifold Ai. At the 
beginning, their relative positions are fixed. Furthermore they are all moving in the same 
direction initially (each velocity vector is initially parallel to every other velocity vector), 
and therefore the initial velocity between any two particles is initially zero. However, be- 
cause we know from the previous section that as this collection of particles moves along 
a collection of geodesies the relative positions of any two of them may change, the second 
derivative of position, or the relative acceleration, between any two points may be non- 
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zero initially. We therefore want to find the acceleration between two arbitrary points. To 
do this we will follow a calculation similar to what we did in section 14.4.21 

Let's repeat the general argument that led to (4.4.51 with a few slight changes. We 
begin with a vector v at p G Ai. We will interpret this as a velocity vector, and we can 
then take the geodesic through p in the direction of v: 




P 



We can parallel transport v along the geodesic defined by v for a small time Ae (where Ae 
is now an infinitesimal scalar, not an infinitesimal vector). In other words, we are parallel 
transporting v a distance Aev: 
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P 



(Notice that we are doing essentially the same calculation that led to (4.4.51 except the 
vector we are parallel transporting is in the same direction as our geodesic). 

We can also take another point q near p, separated by a very small vector Aeu (again 
Ae is an infinitesimal scalar and u is a finite vector), parallel transport v to g so that the 
vector at p and the vector at q are parallel, and then let the vector at q follow its geodesic 
for a time Ae, 
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Our interest (at this point) is the relative velocities of the particles after the time Ae. We 
can find this by taking the difference in velocity at p' and q' , which we find by parallel 
transporting the velocity at p' to q': 
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Comparing this situation to ( 4.4.5[ > we have that Av is given by 



(4.4.21) 



As stated above, we are interested in the relative acceleration of these particles. Ac- 
celeration is simply the change in velocity over change in time, so we can define the 
acceleration vector a as 



lim — — 
Ae^O Ae 



(4.4.22) 



and therefore (4.4.21 1 is 



a 

lim -— 

Ae^o Ae 



(4.4.23) 

This is the equation of geodesic deviation. What it says is nothing really new after section 



4.4.2 If the Riemann tensor vanishes, two objects traveling along geodesies don't accel- 
erate relative to each other|^ If the Riemann tensor is non-zero then they will accelerate 
relative to each other. This relative acceleration is a measurement of how curved a space 
is. 



Now we attempt to recreate what we did in the second half of section 4.4.3 with the 
small volumes. Imagine a small ball of initial radius Rq along every direction. If we wait 
a time Ae (as above), the new radius along the i^^ dimension will b^^ 



r*(Ae) = Ro + -a^Ae)' 



(4.4.24) 



So, from this we can easily calculate 



r^(Ae) 
P(Ae) 



a*Ae 

a' 



(4.4.25) 



where the dot represents a derivative with respect to time Ae. This gives us the relation- 
ships 



r\Ae] 
r\Ae] 
r\Ae] 



Ae=0 



Ae=0 



Ae=0 



Ro 


a' 



(4.4.26) 



''^This is a physical interpretation of the Euclidian postulate that parallel lines don't intersect. We are 
doing non-Euclidian geometry where parallel lines can intersect. 



''^This is nothing but the standard "physics I" expression 



Ax = .To + vot - 



1 



where vq is zero because the particles start off along parallel geodesies and a' is given in equation 1 4.4.22 
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So 

f*(Ae) a* 
lim — 



Ae->o \ Ro J Ae 



Furthermore, because r* is the radius between the starting points, we can without loss of 
generality take u to be a unit vector in the i^^ direction. 

We can get rid of the limit on the right hand side by taking Rq = Ae, giving 

lim ^^^^ = Rii^v'^v' (4.4.28) 

Where we have taken u to be a unit vector in the z*'^ direction, and no summation is 
implied by the repeated i on either side. 

So this volume started off an n-dimensional sphere of radius Rq. This will have vol- 

n 

(you can check this formula by simply doing the integrals, or by checking any introduc- 
tory text on basic geometry), where T(n) is the Euler gamma function. 

In our case, however, the radius along each dimension is changing, and therefore in- 
stead of an n-dimensional sphere with radius Rq, we have an n-dimensional ellipsoid 
with radii r*(Ae) for every i. The volume of this ellipsoid is 

n d 

K = ^;^^n^^(^^) (4.4.30) 



1=1 



where T(n) is the Euler gamma function (you can again check this expression in any intro 
geometry text). Then we can take a derivative with respect to Ae, getting (leaving out the 
Ae dependence for notational simplicity) 

ri d d j 

^^.-flf^En^^- (4.4.31) 

^2 ' ' i=l j=l 



Then 



a / .... i—1 . ,. . i. \ a 

^"i)E ^-2^^ n^'- (4.4.32) 

' ,'—1 \ I — 1 / ^'—1 



33 



Notice that this reduces to the familiar 7r(i?o)^ for a circle and |7r(i?o)^ for the volume of 
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So, 



Vn 



1=1 



fc=l 



(4.4.33) 



Taking the Ae — limit and plugging in (4.4.261, 



lim 



lim — 



4 = 1 



(4.4.34) 



and then plugging in ( 4.4.28[ >, 



j=l i=l 



(4.4.35) 



where we have invoked the normal summation convention in the second to last equality, 
and in the final line we have defined the rank 2 tensor 



Rik = R 



lik 



(4.4.36) 



This tensor is called the Ricci Tensor, and as promised in the last section it tells us the 
quantity by which the volume is changing. And, as we indicated in the last section, it is 
clearly directly related to the Riemann tensor. Specifically, the meaning is that given some 
volume of particles forming a volume 



(4.4.37) 



Ae^O 



if every point of the volume is moved along an initially parallel set of geodesies (all par- 
allel to v), the volume will remain unchanged in a flat space where i?*;;. = 0: 

Vnfi = (4.4.38) 

The first derivative of Vn will always be zero because every point starts off along parallel 
geodesies and therefore no two points are initially moving relative to each other. How- 
ever, they are initially accelerating relative to each other, and therefore in a space where 
Rij is not zero. 



Vnfi = {RijVV )Vn,0 

Again, we will postpone examples until later. 
4.4.5 Third Intuitive Idea of Curvature 



(4.4.39) 



Alice and Bob are standing on a two dimensional manifold, but they don't know any- 
thing about the manifold. Alice suggests an experiment - she happens to have a rope of 
length L. She holds one end and tells Bob to take the other and walk away from her until 
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the rope is pulled tight. Then, she says, her plan is for her to stand still while Bob walks 
in a big circle, keeping the rope pulled tight, therefore staying the same distance from her 
on the manifold. She also tells Bob to keep track of exactly how far he walks before he 
comes back to the same point. Because the rope has length L, they predict that he will 
walk 271 L units before returning to where he started. 

However, after doing this experiment Bob finds that surprisingly he has actually trav- 
elled 4L units! Returning to talk things over with Alice, they realize what is going on. 
They conclude that they must be standing on a sphere with radius — . The circumference 



(at the widest) point of such a sphere would be 27r^ = 4L, the exact distance he actually 
walked. Because L is equal to one quarter of the circumference of the sphere, when Bob 
pulled the rope tight, he had actually walked a quarter of the way around. If Alice had 
been standing on the "top" of the sphere. Bob was standing on the equator. 



The reason the circumference of the circle Bob walked around was not what they ex- 
pected from their "flat space" intuition is because the surface was actually curved. In 
other words, another measure of curvature is the deviation of the volume of the bound- 
ary of a space from what you would expect in flat space. 

More generally, if we take a point p on an n dimensional manifold M. and then take 
the collection of all points a particular distance away, that subspace will have an n — 1 
dimensional volume. One could calculate what they would guess that volume to be if 
they were in flat space. But if the space is curved there will be some deviation. The 
specific measure of curvature we will be interested in is the lowest order correction to the 
n — 1 volume. We will make this more precise in the next section. 

4.4.6 The Ricci Scalar 

To make the idea of the previous section precise, we begin with some point p E M. (we 
will be assuming M. has dimension n). We will define an n — 1 volume by taking every 



Alice 
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point some distance A away from p: 




This will define an n — 1 subspace homeomorphic to S"'~^ C Ai (we will denote this 
subspace S*""^ for the rest of this section). Because we will approach this problem by 
defining everything in terms of an enclosure of p, maintaining generality will demand 
that we only use terms evaluated at p. This will make our calculation quite difficult and 
we will therefore proceed slowly. 

To begin with, we will define coordinates on S"'~^, which we denote 9^ (for i = 
1, . . . , n — 1). We map a given point to Ai with the coordinate functions on A^: 

6 — > x\e) (4.4.40) 

The collection of all points x(0) for all 6 is then the set of all points of S^^^. 

Our ultimate interest is in the n — \ volume of 5"^, which we find by integrating over 
the invariant n — 1 volume form (cf section 4.3.2p 



ns^-i = ^A^\ de^ Ad9^A---A de""-^ (4.4.41) 

where (jij is the metric on 5'"~^p^ 

So, working from p e M, we want to find gij at all points p + \ (along all possible 
geodesies through p) in all directions. We assume that we have the metric gab on M, 

^''With the appropriate handling of multiple coordinate patches with partitions of unity, etc. - cf section 



2.3.8 
— 35 



For this section we will primarily use a,b,c, . . . for coordinates in M and i, j, k,. . . for coordinates on 
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and we can therefore use the pullback from x^{0^) to find the induced metric gij on S^~^. 
However, as we said above, we must be careful because we want to define everything in 
terms of what we know at p, which will demand some work because clearly p ^ S*""^. 

For an arbitrary point on 5"^^ (at p + A), we have the general pullback given by (cf 
section IZZSl 



dx''{p + X)dx\p + X) 



9ab{P + A) 



(4.4.42) 



We must therefore expand all three term on the right hand side around p. 

Starting with the derivative terms on the right, the exact coordinates of x"'{p + A) will 
be given by 



x\p + \) =x\p) + \ 



dx"'{p) 1 2 



+ 7^^ 



9A2 



9A3 



(4.4.43) 



This will be valid to arbitrary A, but we can simplify it by assuming that A parameterizes 
a geodesic. In this case we have (cf equation ( |4.2.69 >) 



Q2^a 



^„ dx^ dx^ 



dx^dh^ 



dVl^ dx^ dx'^ 



dX dX dX 

^ dx^ d'^x^ dx'^ dx^ dx^ 



dX dX^ 



dX dX dX 



(4.4.44) 



where da 



■7^. So (all terms are evaluated at p so we drop the arguments) 



x^{p + A) = + A 



dx"" X^ „ dx^ dx^ 



dX 2^^' dX dX 6 dX dX dX 



Q d Q b Q C Q^b Q . 



be 



dX dX 



+ ■■■ 



Av( 



(4.4.45) 
, where v is some 



Then if we use normal coordinates at p we can take (at p) x 
vector at p in the direction of a particular 6 G 5*"^^ So, from section 4.2.9 we have 



Q2^a 
be 

x\p) 
dx"" 



So finally. 



dX 










(4.4.46) 



x''{p + X) 



Aw" 
Aw" 



o 



(4.4.47) 
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where have symmetrized the sum in the last term. 



So, finally, the partial derivative terms in (4.4.42 1 are given by 



09^ 



(4.4.48) 



where the product rule with the partial derivative on the three v's has resulted in the 
factor of 3. 

The metric term in (4.4.42) is a bit easier to expand: 



9ab{p + A) = gabiP) + A + -A + 



dx^ 1 

9ab + X^deQab + ^"^^ 
OA 2 



dx'^ 



dd9. 



ab 



9ab + ^>^v%'^d^ddgab 



(4.4.49) 



where we have used (4.2.143 1. 

So we can finally piece together (4.4.421 to form gij. Multiplying this out and keeping 
terms to only fourth order in A (we will switch dummy indices around quite a bit in what 
follows). 



dx''(p + X)dx^(p + X) , 

9ij = 9ab{P + X) 



X 

X' 
X' 

X' 

X' 

X' 



89^ 

dv°- X^ 



39' 




Qya 


8v^ 


89' 


89^ 


dv" 


8v^ 


09' 


89j 




8v^ 


89' 


89i 


dv" 


8v^ 


89^ 


89i 




8v^ 


89' 


89^ 



6 



89^ 



,8v^ 

W' 



\ 8v^ X^ ^ . f „8v^ 



\2 

Qab + -:^v^v^8k8igab 



A^ 8v°- ^ ^i, f „ 8v^ X'^ ^ ^„ J ^ 8v'^ 8v^ 



^4 Qya Qyb 



9^b- -^-^U(fl gh) 

8v'^ 8v^ 



9ab + 



9ab + 



2 
2 
2 

A^ 



89^ 89^ 

Qye Qyb 
WW 



rv%''8c8dgeb - ^8^c^''^^)V^v'^gab 



J gab 
+ 



v^v^8k8igab 



2 89' 893 
8v^ 8v'^ 8v^ 8v 



89' 89^ 89'89i 



""1 „.c„.d. 



(Qye Qyb>. 



2 \ 89' ' 89j j 



'dgeb 



J 8v^ 


8v^y 


XW'' 


W ] 




( 8v^ 


"Vgab 





J gab 



j 8v^ 


8v^) 


[w 


W J 



8c8dgeb - -8(cTae)gab 



(4.4.50) 



where the curly brackets are, as usual, the anticommutatoij^ 

Looking at the term in brackets in the last line (and remembering that in normal coor- 
dinates 8agbc = 0), 



8c8dgeb - ■^di^J"^^)gab 



8c8dgeb - 7^i8c8egbd - 8c8dgbe + 8c8bged) 



■ecdb 



(4.4.51) 



36, 



So {A, B} = AB + B A. 
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where the last line is the Riemann tensor in normal coordinates (cf equation (4.4.11 )). 
So now we can write 



9ij 



ecdb 



(4.4.52) 



With foresight, let's look more closely at the original metric gab on A^. Consider using, 
instead of the coordinates on ^A, generalized spherical coordinates with radial direc- 
tion A and angular coordinates 9\ Denote this metric on 7V1 Gab {Gab will be an n x 
matrix). We can then do a normal coordinate transformation to find (if i = 1 is the radial 
direction) 



Gil 
Gic 



gab = v'^v'^gab = 1 



A d 



cd 



dX dX 

Q^a Q^b ^ Q^b 

Q^a Q^b Q^a Q^b 

l9ab = X -7:rj-^gab = X h. 



V V gab 



X_d_ 
206^ 







cd 



(4.4.53) 



QQc QQd^'^" QQc QQd^ 

where c 7^ 1 in the second line and c, (i 7^ 1 in the third. So Gab is the block diagonal matrix 

\ 



Gab- 



X^h 



ab 



and therefore 



G 



ab • 



\ 



/I 




(4.4.54) 



\ 



1 uab 



\ 



(4.4.55) 



We can then transform back to the coordinates 

Q^a Q^b 



9 



ab 



dX dX 89' 89^ 

,8v"' 8v^ 1 ■■ 



89' 89^ A2 
„ u 8v°' 8v^ ■ . 



89' 89^ 



This then implies 



, ,,8v"' 8v^ 

n ■' 

89' 89^ 



gab _ ^a^b 



(4.4.56) 



(4.4.57) 
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So now. 



2\2 



6 \de'' dei] 



■ecdb 



(4.4.58) 



To find the determinant g, we start with the general relationship that can be found in 
any introductory linear algebra text. For matrices X and A, 



det{A + eX) = det(A)(l + eTr{A-^X)) 



(4.4.59) 



Using this we can easily compute the determinant of ( 4.4.58[ >, getting (the determinant of 

hij is h) 



g = hi 1 H v^v 



j dv^ dv'' 1 

6 ^ " [Wde^j 



ecdb 



(4.4.60) 



We can simplify this using (4.4.571: 

9 



■ecdb 



h[l + -v%''ig'''-v%')Recdb 
h[l + ^v^'^g^'Recdb - ^v%''v%''R,,db 



(4.4.61) 



The last term involves a symmetric sum over Recdb, which is antisymmetric in the first 
two and last two indices (cf ( 4.4.10| >) and it therefore vanishes. So 

~9 = hfl + ^v^^g^'R,,,, 



\2 

hll- —v^v'^R^ 

\2 

h{l- —v'v'^R. 



cbd 



cd 



(4.4.62) 



So then, finally, the n — \ dimensional volume form is 



\g\de^ A---Ad9'' 



n-l 



Vh^i - —v^v^Rcd de^ A---A de''-^ 
Vh(^- ^v%'^R^^ de^ A---A de"-^ 



\2 

= Vhde^ A---A Vhv'v^Rcd de^ A---A d& 

6 

So finally, we can find the n — 1 volume in the curved space, V^: 

A2 



in— 1 



(4.4.63) 



V, 



VhdO^ ■ ■ ■ de""-^ - ^Rcd j y/hv%'^de^ ■ ■ ■ de""-^ (4.4.64) 
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The first term has no dependence on the Riemann or Ricci tensors - in other words it 
represents the volume contribution in flat space, Vj. In the second term we are integrating 
the product of v over all angles. But from the first equation in ( 4.4.53[ > we know that v is 



invariant under rotations. Therefore the integral over them must be simply proportional 
to the metric 

j Vhv^'^de^ ■ ■ ■ dO""-^ oc g"^ (4.4.65) 
Inserting a proportionality constant and contracting both sides with the metric: 

gcd j ^Thv^^de^ ■ ■ ■ dr-i = ag.dg''' 



h{i)de^---de 



= an 



Vf = an 

a = ^ (4.4.66) 

n 



With all of this, (4.4.641 can be simplified as 



V, = Vf- ^Rcdg'^a = Vf-^^R = Vf(l- ^r) (4.4.67) 
6 b n \ bn J 

where 

R = Rabg''" (4.4.68) 

is called the Ricci Scalar. Its meaning is that it is the lowest order correction to a volume 
compared to what would be expected in flat space. 

4.4.7 Fourth Intuitive Idea of Curvature 



The technical derivation of the final curvature tensor we are interested in is much 
more self-explanatory than the previous three. We therefore keep our comments here 
brief. Imagine that you are interested in the curvature of a manifold, but not the entire 
manifold. Rather, you are interested in "slices" of it. For example, imagine a tube: 

^^This is a standard trick in differential geometry - the metric plays the role of the identity in tensor 
equations. 
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(imagine it going off in both directions forever). You may have information about the 
curvature across the whole thing, but let's say that you don't want that much information. 
Instead you only want information about the curvature at a single slice: 




You can specify this slice by choosing a unit vector at a point and taking the slice orthog- 
onal to it: 
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So, this final notion of curvature doesn't provide a new way of measuring curvature, 
but rather introduces the notion of "sectional curvature". 

4.4.8 The Einstein Tensor 

Consider a point p E M. We will start with the Riemann tensor Rabcd and contract to 
get the Ricci scalar. However we aren't interested in the Ricci scalar on all of M.. Instead 
we are only interested in the curvature of the subspace of M. that is orthogonal to some 
V G TpfA (of unit length, gabv"'v^ = 1). So whereas normally we would get the Ricci scalar 
from Rabcd by contracting it as 

g^'g^'Rabcd = R (4.4.69) 

that won't work here. Rather, we must project off from each of these contractions the 
component that is parallel to v. Therefore we replace g""^ with g'^^ — v"'v'^ in the above 
contraction. We will denote this "directional" curvature value —2Gabv"'v^^\ 

^g-c_^a^c^^gbd_^b^d^j^^^^ 

g'^'g'" Rabcd - g'^'v'v'' Rated - g'^V^'Rabcd + V%%\^Ratcd 

R - 2g'"'v'v'' Rabcd 
R - 2vVRbd 
Rv\^gM - 2v\^Rm 
{RgM - 2Rm)vV 

Rbd - \gbdR (4.4.70) 

''^Don't worry about the factor of —2 - it's not important at this point and we merely include it for agree- 
ment with what will come later You can think of it as an irrelevant scale factor for now. 



b„.d 



- 2Gmv'v 



G 



bd 
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This new tensor. 

Gab = Rab - loabR (4.4.71) 

is called the Einstein Tensor. So, given an n-dimensional manifold with Ricci scalar 
and a vector v G TpM for some p e M, the n — 1 dimensional subspace of M. orthogonal 
to V will have curvature 

i?(n-l) = GabV^' (4.4.72) 

This notion of "sectional curvature" will play a very important role in general relativ- 
ity. Specifically the vectors v that we will use to define an orthogonal subspace will be 
timelike vectors. Given a particular unit timelike vector, the subspace orthogonal to it will 
be the purely spatial region for a given instant in time. General relativity will say that the 
curvature of a spatial "slice" of spacetime, given by the Einstein tensor, is proportional to 
the energy density in the spacetime. We will discuss this in more detail later. 



4.5 Examples of Curvature Tensors 

We will now go through the manifolds we have been considering, that we know the 
metrics for, and compute each curvature tensor for them. We summarize our results so 
far, starting with the metric, in the following table: 



Name 


Tensor 


Definition 


Metric 


9ij 




Connection 


p. 


y'^djgki + dkgij - digjk) 


Riemann Tensor 




l-'fc-L ij OIL f^j -t- i ^^i ij i 


Ricci Tensor 


Rij 


TDk 

^ikj 


Ricci Scalar 


R 


g'^Rij 



We have found the metrics and connections for several manifolds already (cf sections 



4.1.4 and 4.2.71. We will now go through each of those manifolds and compute the curva- 



ture tensor for each. 
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4.5.1 



We found the metric on the circle by embedding it in 

X — r cos 9 

y = rsm9 (4.5.1) 
and using the puUback. We previously found that the metric for the circle is 

= r^dO^ (4.5.2) 

or 

gij = gee = (4.5.3) 

As shown above this lead to 

4 = yt,j,k (4.5.4) 

And therefore all of the curvature tensor vanish. Therefore, the circle has no curvature. 

This result may seem surprising. We will therefore discuss what it means for a mani- 
fold to be "flat" in the next section. To get an initial intuitive feeling for why it is flat, recall 
how we derived the Riemarm tensor. We parallel transported a vector around two paths 
to the same point and took the difference. If you imagine doing this on a circle, it is clear 
that the only two paths from point p to point q are the two directions around the circle, 
and obviously in both cases the vector will be the same in the end. We will discuss this 
notion of flatness (and others) in section ??. 

4.5.2 Stretched Out 



The next manifold we considered was the "stretched out" circle or the ellipse, which 
we mapped using 

X — 2r cos 9 

y — rsm9 (4.5.5) 

Even though we pointed out that this is actually identical to the circle (up to coordinate 
transformation) we will still consider it separately to show this equivalence explicitly. The 
metric was 

or 

Qij = gee = r^A cos^ 9 + sin^ 9) (4.5.7) 

We found for this that the connection coefficient is 

, 3sin(2g) 

^ -5 + 4cos(2^) ^^-^-^^ 
It is then straightforward to show that 

R%i = Vz,j,fc,; (4.5.9) 
So all of the curvature tensors vanish and we see that the ellipse also has no curvature. 
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4.5.3 



The next manifold we considered was the sphere, which we mapped using 

X — sin 9 cos 
y — sin 9 sin 

z = cos^ (4.5.10) 



The metric was 



ds^ = d9^ + sin^ 9d(l)^ (4.5.11) 



9ij^[l ^2.] (4.5.12) 



or 

" ^^^0 sin^^^ 

We then found that the non-zero connection coefficients were 

rj^ = -sin^cos^ 

4^ = rJ, = cot^ (4.5.13) 

From these we can compute the Riemann tensor, which has non-vanishing compo- 
nents 

Ko4> = -Rle = ^^^'0 (4.5.14) 

It is good that we found that i?* ^.^ is non-vanishing for the sphere - it was the example we 
used to define K^jf^i, so if it was zero we'd be in trouble! 
Next, the Ricci tensor has non- vanishing components 

Ree — 1 

R4,4, = sin^e (4.5.15) 

And finally the Ricci scalar is 

R = + /*i^e<^ + + 

= 1 + sin^ 9 

sin^6' 

= 2 (4.5.16) 

So the sphere has constant curvature. This isn't surprising - every point on S"^ is "sloped" 
the same as any other point. There is no variation in how curved it is from point to point. 
Before moving on, notice that 
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The fact that it is proportional to (in fact, equal) the metric is not surprising. On any 
manifold of constant curvature R, 

R = g^-'Rij = const =^ Rij oc gij 

^ R = g'^Rij = g'^agij = an (4.5.18) 

where a is the proportionality constant and n is the dimension of the manifold. Specifi- 
cally, 

a = - (4.5.19) 

n 

So, we know that here R = 2 and n = 2, so a = | = 1, and therefore 

Rij = agij = gij (4.5.20) 

we we found above. 

As is clear from (4.5.10 1, we choose this sphere to have radius 1. You can go back and 
convince yourself that had we taken the radius to be r, we would have had 

R=^ (4.5.21) 



r 



So the larger the radius the smaller the curvature. This makes sense. On a small sphere 
the curvature would be easy to notice. On a larger sphere (like the Earth), the surface 
looks flat (i.e. less curved). 

Finally, we can compute the Einstein tensor: 

^ij '^ij T^Sij^ 





° ] 








I 


sin^ 


2 


I 


sin^ 9 i 



= (4.5.22) 

This makes sense - for a given vector somewhere on S^, the subspace orthogonal will be 
a circle S^, which we know from above is flat. 

4.5.4 52 Stretched Out 

The stretched out S'^ given by 

X = sin 6 cos (p 
y = sin 6 sin 

z = XcosO (4.5.23) 
is more interesting. We found above that the metric is 

ds^ = 1((A2 + 1) - (A^ - 1) cos(2^))ci^2 ^ gi^2 g^^2 (4.5.24) 
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or 



9 if- 



A((A^ + l)-(f-l)-(20)) \ 
\ siir^y 



We also found the non-zero connection coefficients to be 

(A2 - 1) sin(2^) 



r 



(A2 + 1) - (A2 - 1) cos(20) 
2 cos 6 sin 6 



(A2 + l)-(A2-l)cos(2^) 
= rJ, = cot^^ (4.5.26) 

Then (with either a lot of tedious work or a computer), it is straightforward to calculate 
the non-vanishing components of Riemann: 



(A2 + 1) - (A2 - l)cos(2e) 



- ((A2 + i)_(p_i)eos(2^))^ ^ ^ ^ 



The Ricci tensor has non-vanishing components 



(A2 + 1) - (A2 - 1) cos(2^) 



4A2sin2^ 

R^^ = ^2 4.5.28 

(A2 + l)-(A2-l)cos(2^))' 



And finally, the Ricci curvature scalar is 

8A2 



((A2 + l)-(A2-l)cos(2^))' 



(4.5.29) 



To try to give a bit more intuition about what the curvature scalar represents, let's look 
at it more closely for various values of A|^ First, notice that all three tensors (Riemann, 
Ricci, and the scalar) all reduce to their values for the un-stretched ^2 in the previous 
section for A = 1. Next, notice that the scalar has no dependence. This is because the 
distortion due to A is entirely in the z direction, and because is the azimuthal angle it is 



invariant under distortions along the z axis. From the pictures in section 4.1.4 it is clear 
that for any fixed 9 all points are symmetric around the z axis. 
We can graph the curvature for various values of A and 9: 



^^It will be very helpful to read over the section on this manifold in section 4.1.4 especially the pictures 
for various A. 
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We have taken A (the axis on the bottom left) to run from to 2, and 9 (the bottom right 
axis) to run from to vr. Notice that at A = 1 the curvature is constant (at 2) as expected 
(this is the standard sphere). For A = 2, the sphere is elongated along the z axis (cf picture 
of egg on page 175[ >. As expected the curvature takes the greatest values at 6* = and 
9 = Tc, and the least value at ^ = |. This is exactly what we would expect - the curvature 
is greatest in the ±z directions and least at 2; = 0. 

For A — s> (cf picture of pancake on page 175 1, the curvature is very small (approaching 
0) for 9 = through close to 6* = |, which is what we would expect. For A = 0, the 
curvature will become infinite at 6* = | and zero elsewhere. 

Finally we can form the Einstein tensor. But before doing so, let's think about what 
we'd expect it to be. With the sphere we pointed out that a cross section is S^. And, as we 
saw above, is always flat. So even on this deformed sphere we would expect that the 
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Einstein tensor vanishes. So, testing our intuition: 



Gij Rij ^dij^ 



2A2 







(A2+1)-{A2-1) cos 26* 

g 4A2 sin^ 9 



((A2+l)-{A2-l)cos(26»))" 

1 fl ((A2 + 1) - (A2 - 1) cos(2^) \ 8A2 

sin^ 9 J ((A2 + 1) _ (A2 _ 1) cos(2^)) ' 



2A^ 



(A2+l)-(A2-l)cos26» 



Q 4A2 sin2 d 



((A2+l)-(A2-l)cos(26())' 




(A2 + l)-(A2-l)cos26» 

g 4A2 sin2 d 



((A2+l)-(A2-l)cos(26l))' 

= (4.5.30) 
4.5.5 Torus 

Finally, we considered the torus with map 

X — {R + r cos 9) cos 
X — {R + r cos 9) sin (f) 

z = rsin^ (4.5.31) 

This gave metric 

„2j/)2 I /d I ^„^„/)\2 7/2 



or 



ds' = r'd^' + (i? + r cos 9yd(l)' (4.5.32) 



. /r^ \ 

^'^■"1^0 {R + rcos9)y ^^-^-^^^ 

This metric gave us the non-vanishing components of the connection: 

{R + r cos 9) sin 9 

r-^ _ r"^ - ^^^^^ r4 5 34) 

This connection will then give 

Ri,i^O Vi,j,k,l (4.5.35) 

and therefore all curvature tensors vanish. Thus is a flat manifold. Again, you can be- 
gin to see why by considering a vector being parallel transported around different paths 
to the same point. You should convince yourself that the final vector will not depend on 
the path taken. 
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4.5.6 S^®R 



All of our previous examples have been in either one or two dimensions. While this 
is helpful to get some initial intuition it is not nearly general enough. We therefore con- 
clude by adding to the manifolds we have been considering one final three dimensional 
example. 

Consider the three dimensional space generated by "dragging" a sphere along the real 
line. In other words, at any point you could move in the two dimensional space of S'^ as 
well as the additional degree of freedom of M. We can think of this as the subspace in 
defined by the map: 

X = sin 9 cos (p 
y — sin 9 sin (p 
z — cos 9 

w ^ jjj (4.5.36) 

where 

9 e [0,7r) 
e [0,27r) 

^ e [0,7r) (4.5.37) 

This is essentially attaching an S'^ to every point on R between and tt. We can then 
compute the metric: 



ds^ = d9' + sin' 9d(f)' + dip 
'52 



= dsl^ + dijj^ (4.5.38) 



Before moving on to compute the remaining values, this metric shouldn't be a surprise 
- all we have done is add the single flat one dimensional degree of freedom. We would 
therefore expect the metric to have this form. Furthermore, because this has added only a 
flat degree of freedom, we don't expect the curvature to differ drastically from the sphere 
by itself. 

With that said, we can begin computing to see if we're right. The non-vanishing con- 
nection terms are: 

rj^ = -cos^sin^ (4.5.39) 
The non-vanishing Riemann tensor values are: 

R%4> = -R%e = sin^ 9 

RU = -Ree^=^ (4-5.40) 

The Ricci tensor is then 




Rij= sin' 9 (4.5.41) 
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And the curvature scalar is 



R 



(4.5.42) 



So, just as we suspected, the curvature is exactly the same as with S*^ by itself. However 
we can compute 




(4.5.43) 



To see what this means, consider a unit vector v in the 6 or (p directions (tangent to the 
sphere). This will give 



Gijv'v^ = 



(4.5.44) 



This is exactly what we found above in (4.5.22). A cross section of a vector in the 6 or 



direction will be a cross section of S"^, which is S^, which we know is flat. 
However, if v is a unit vector in the 'ip direction, we have 



GijvV 



1 



(4.5.45) 



And, from our definition of Gij (cf top line in equation ( 4.4.70[ )) we know that the "sec- 
tional curvature" we are interested in is 



2Gi 



R 



(2) 



(4.5.46) 



This is what we should expect. A cross section orthogonal to given vector in the ip direc- 
tion will simply be a copy of S*^, which has curvature 2. 

We could conclude here, but we will investigate manifolds of this type a bit more. 
One common type of problem in physics (namely general relativity) is that we have some 
information about Einstein's tensor and we want to know the metric. You are welcome 
to write out Gij entirely in terms of the metric, but you will find it to be an extraordi- 
narily complicated expression. Furthermore, you will find that, given Gij, solving for gij 
involves solving a non-linear partial differential equation that, in general, can't be solved. 



But there is still much we can do. Let's consider (4.5.43 1 to be the flat, or homogeneous 
case. In other words, if we define the matrix 



Go 




(4.5.47) 



Then, taking Gij to be a collection of first and second derivatives of gij, we interpret 

(4.5.48) 

to be a homogeneous differential equation for gij. The solutions will then obviously be 
(143381). 



Gij— Go 
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Now let's say that Gij ^ Gq, but rather there is some source term on the right hand 
side that is "driving" the geometr}|^ of S*^ ® M in the same way that a source term in 
Maxwell's equation "drives" the electric and magnetic fields. We will call this source term 
Tij. So, our inhomogeneous differential equation for g^j is now 



Gi 



T- 

-J- I-: 



(4.5.49) 



As we said above, this cannot be easily solved for arbitrary T^j. However, it is of- 
ten possible to make certain physical assumptions that can simplify the solution. For 
example, we might assume based on a symmetry argument that the sphere must not be 
deformed into an egg or a pancake, etc. Therefore any term on the rfs^a part of the metric 
must act out front as a radial term. Furthermore, we may assume that the source acts 
homogeneously throughout S"^, and therefore the source can only depend on ip. 

With these assumptions, we guess that the metric will have the form 



(4.5.50) 



where a{ip) is some unknown function. 

We can redo the calculations of the curvature tensors, and we will find that now 



G 



(l+a2)2 







V 








(l+a2)2 







\ 





(4.5.51) 



where the dot represents a derivative with respect to 4'- Also, noticing that a{ip) plays the 
role of the radius in (4.5.50), and in Gij here we see that for constant radius the sectional 
curvature (^33) is — -\, so the curvature is ^ (because of the factor of —2 in the definition 



of Gij). This agrees with what we had above in (4.5.21 1. 

Let's assume that a(ip) is a physically meaningful quantity, and we can use it to define 
the source term that is driving our geometry. In other words, we can write Tij in terms of 
a{'ijj). Let's then say that some sort of physical considerations tell us that the source term 
has the formFI 



(l+d2)2 









V 



(l+a2)2 





\ 





(4.5.52) 



So, by making the "physical" assumptions about the symmetry and homogeneity on 
S*^, we have traded the unsolvable differential equation for gij for a much easier set of 

''"recall that the topology 5^ (8) M can, via homeomorphism, have a huge array of metrics and therefore a 
huge array of geometries. In this sense the source term will change the geometry but not the topology, and 
the manifold will always be S'^ M. 

*^It may seem strange for us to introduce this mysterious "geometrical source term" without any expla- 
nation about where it comes from. For now just take for granted that some sort of physical reasoning can 
give it to you (at least in terms of a set of physical parameters like a{tp), and the equation 14.5.49 1 can be 
formed in a relatively straightforward way. 



276 



equations for a physical parameter a{ip). Reading from the above equations for Gij and 
Tij, we actually only have one unique equation: 

ad — —uP'a?' =^ a = —uP'a (4.5.53) 

which has solutions 

a(V') = A sin(a;V') + B cos(a;V') (4.5.54) 

The values of A and B can then be found via boundary conditions. For example, 5 = 
and u — \ gives 

ds^ = sin(V^)ds|2 + (4.5.55) 

This corresponds to starting with a singularity (an S"^ with zero radius) at ^/^ = 0, the 
sphere growing to have unit radius at = \, and then crunching back down to a singu- 
larity atijj = tt: 




4.6 Concluding Thoughts on Differential Geometry 

Before concluding this chapter we briefly discuss what we have done and what we 
still have to do. In this chapter we essentially added a single structure, the metric, to the 
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general topological manifolds M from the prior chapters. This structure did two things - 
it fixed the structure of the manifold so that we don't have the enormous homeomorphism 
redundancies discussed previously, and it greatly enhanced our ability to do calculus on 
M. 

One of the most important consequences of what the metric does is that it gives us a 
precise way of discussing curvature - the Riemann and Ricci tensors and the curvature 
scalar provide a powerful tool for probing a manifold, and the "sectional curvature" that 
the Einstein tensor provides will also be very useful. 

We have now spent nearly 300 pages doing math. You have likely noticed that there 
has hardly been a single word of physics. In the next few chapters, however, we will begin 
to use what we have learned so far to (finally) do some physics. As the title of this paper 
(and this series) indicates, our interest is ultimately particle physics. However, we will 
spend some time appearing to be distracted from that goal by studying electrodynamics, 
general relativity and cosmology. 

The reason for this is two-fold. The first reason is that the current status of high energy 
particle physics and cosmology leaves them, at times, intimately linked. String theory 
(our long term goal with this series), which is a theory of quantum gravity, is perhaps the 
most glaring example of this. Many of the most important and exciting aspects of string 
theory are ultimately cosmological, and vice versa. Therefore it will be necessary when 
studying string theory later to have some grasp of general relativity. 

The second reason is that our goal with this series is to provide not only an overview 
of the technical aspects of particle physics, but also provide some intuition. We have de- 
veloped a great deal of topology and geometry, but not nearly enough for string theory or 
even a respectable stab at gauge theories. And while the geometric and topological tools 
we will develop in order to study more advanced topics are extraordinarily powerful and 
elegant, is is very, very easy to completely lose sight of their meaning. No physicist should 
feel comfortable "plugging and chugging" with equations he or she doesn't understand. 

Furthermore, providing intuition with the geometrical and topological ideas we will 
be discussing later becomes increasingly difficult, especially when we make the jump 
to complex and algebraic geometry and look at the related topological concepts. There- 
fore, in order to ease the process of understanding the more advanced topics, we will 
invest time now to understanding the relatively simple ideas we have discussed thus far. 
General relativity in particular provides a very nice and (somewhat) straightforward ap- 
plication of the math thus far outlined. Understanding it should give a deeper intuition 
about how these ideas come into play in physics. 

As an indication of how this will help, recall that we said above that general relativity 
is fundamentally a theory of the Levi-Civita connection. Certain physical considerations 
demand that the connections be torsion free and metric compatible when studying grav- 
itation, and this greatly simplifies the situation. When we begin studying general gauge 
theories later, however, we won't have the luxury of only considering the Levi-Civita case 
- the connections there are much more general and can be much more complicated. If you 
don't have a good grasp of the Levi-Cevitia cormection, curvature, and how geometry 
affects physics, you will likely be lost when trying to make sense of the generalizations. 

So with that said, we will spend several chapters now doing physics before transition- 
ing back to math in the next paper in this series. We hope this is somewhat refreshing 
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after several hundred pages of math, and that it helps make the ideas developed so far 
more concrete. 



4.7 References and Further Reading 

The primary reference for this chapter was ^7\, though we also made extensive use 
of ||8l, |[TT|, |[28J, and most notably ||42|1. The derivation of the Ricci tensor and Ricci scalar 
was found in ||3l. For further reading we recommend IHI, 10, El, (13, lEl, lEl, lH, 

Ea, Bn, and ma. 
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Part II 



Physics 
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Chapter 5 
Electrodynamics 



The first physical topic we will discuss is electrodynamics. We do this for a variety of 
reasons. First of all, it provides a nice physical application of some of the mathematical 
ideas we have discussed, and ties them to something you should already be familiar with. 
Second, the ideas developed here will he helpful when we discuss general relativity and 
cosmology. Third, the underlying ideas of electrodynamics (believe it or not) play a huge 
role in string theory, and here is as good a time as any to introduce them. Fourth, the 
long term goal of this paper is to build a more thorough understanding of gauge theories, 
and as we saw in ||40| , electrodynamics is the simplest and most straightforward gauge 
theory. We will consider it in a fuller geometric context later, but introducing it now at an 
intermediate step will help the transition to the more complete picture. 

5.1 Formalisms of Electrodynamics 

The primary point of this section is not to necessarily provide any real mathematical 
or physical insight, but merely to illustrate how the mathematics we have developed 
previously (primarily differential forms and cohomology) can help clarify and generalize 
a physical theory. 

5.1.1 Electrodynamics and Vectors 
Maxwell's Equations 

Electrodynamics is based almost entirely on Maxwell's equations^] 



V xE 



V X B 



As usual we work in units where c = 1. 



+ 



V -B 

OB 
~dt 
V -E 

dE 
" ~dt 






P 
J 



(5.1.1) 
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where E are B are the electric and magnetic fields, respectively, J (current per area) is 
the source current, and p (charge per volume) is the electric charge density. The first two 
are the source free, or homogeneous, equations, and the last two are the inhomogeneous 
source equations. 



The Continuity Equation 

An important consequence of Maxwell's equations is the charge conservation law. Con- 
sider the last of (5.1.1 ), and take the divergence of both sides: 

dE _ _ - a 



V X B 



dt 



J 



V -V X B 

dp 



V ■ J + 



dt 



dt 
■ 



V ■E = V-J 



(5.1.2) 



where we used the third of (5.1.1 1 and the fact that the divergence of a curl always van- 
ishes to get the last line. This equation is the conservation equation which says that charge 
must be conserved. 



The Gauge Potentials 

In any introductory or graduate course on electromagnetism one usually introduces the 
scalar and vector potentials and A, and then the electric and magnetic fields are defined 
by 

E = ---V^ 

B = V xA (5.1.3) 

In the introductory courses this is motivated for two reasons: it often makes solving 
Maxwell's equations easier, and it essentially removes half of them. Consider the right 
hand side of the first homogeneous equation, written in terms of the scalar and vector 
potential: 

V -B = V • (V X 1) 

= (5.1.4) 

The final equalit y is an identity - the divergence of a curl vanishes automatically. So, with 



B defined as in (5.1.31 the first of Maxwell's theories are automatically satisfies - no more 
work need be done. 

Similarly for the second homogeneous equation: 

= -_(Vxl)-Vx V0+-(Vxi) 

= (5.1.5) 
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where the first and last term cancel and the second vanishes because the curl of a gradient 
automatically vanishes. 

So when E and B are written in terms of (p and A, the homogeneous Maxwell equa- 
tions are automatically satisfied. A course in electrodynamics then typically consists of 
learning about a billion tricks to solve the inhomogeneous equations for (p and A. 



Gauge Transformations 

Another important idea is that of a gauge transformation. For a given (p and A, we can 
make the transformations 

A — > A' = A + Vx (5.1.6) 



where x is an arbitrary (scalar) function. Plugging these new values into (5.1.3) gives 



-|(^%v,)-vw-|) 

E (5.1.7) 



and 



B' = V X A' 

= Vx{A + Vx) 

= V X A + V X Vx 

= V X 1 

= B (5.1.8) 

So in trying to solve Maxwell's equations for and A, we are free to make any gauge 
transformation we want to simplify finding solutions. 

Application of Stokes Theorem 

Finally, consider the first inhomogeneous Maxwell equation: 

V ■E = p (5.1.9) 
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We can use this to find the exact expression for the electric field p. Consider a very small 
charged particle with charge density p and total charge Q at the center of a three dimen- 



sional ball B^. We can integrate both sides of (5.1.9) through the ball to get the total 
charge: 



dW-E = / dVp (5.1.10) 

£3 Jb3 

where dV is the three dimensional volume element. The right hand side will clearly be 
the total charge Q. On the left hand side we can use the well-known divergence theoremj^ 



dW-E = dA-E (5.1.11) 

B3 JdsB^ 



where d^B^ is the boundary operator (taking B^ to the boundary, or 5^), (cf section 3.2.41 
and dA is the area element on the surface of the 5*^. We then assume that E depends only 
on the radial direction from the charged particle, and therefore it is constant across S'^. So, 



dA-E = E- dA 

8333 Jd^B'i 

= E{r){47rr^) (5.1.12) 

where we have assumed that E has only a radial direction (and hence depends only on r), 
and therefore the angle between E and dA is always (and cos(O) = 1), so we get simply 
the magnitude of E (the direction is understood to be in the r direction). So, finally. 



E{r){Anr^) = [ dVp = Q 



which is the standard expression for an electric field E{r) a distance r from a point charge 
Q. 

A similar calculation using the first of ( 5.1.1[ > indicates that there are no such things 



as magnetic charge^ How they might be incorporated into electrodynamics is not clear 
in the formalism we are currently presenting. We will see later how such a thing may be 
built in naturally. 

Furthermore, how might electromagnetism be generalized to an arbitrary manifold? 
We have been assuming in this section that these fields exist on M'^ with time as a param- 
eter. This is an extremely limiting constraint, but finding the correct generalization is not 
straightforward at this point. 

^As learned in any introductory calculus series. 
^in classical electrodynamics. 



284 



Electrodynamic Lagrangian 

As with any physical theory, we would ultimately like to put it in the form of an action]^ 
Starting with the energy of an electromagnetic fielc^ 

-{\E\^-\B\^) (5.1.14) 



V 



We can write this out in terms of and A getting 



(^) +(Wf + |V-^? + 2 



dA 



■ Vc 



(5.1.15) 



Then, including a source term — p0 + J ■ A, you can show (with a great deal of tedium) 
that variation of the Lagrangian 



C 



f + \V-A\' + 2^-V^-V\Af 

ot 



+ 



-J- A 



(5.1.16) 



will give Maxwell's equations. 

However, this approach is not particularly interesting (or useful), and we therefore 
won't pursue it. We mention it now merely to provide something to compare our later 
(more useful) results to. 

5.1.2 Electrodynamics and Tensors 

Maxwell's Equations 

One major difference between classical mechanics and classical electrodynamics is that 
classical mechanics is not an automatically (special) relativistic theory - electrodynamics 
is. Special relativity works not with 3 dimensional space with time as a parameter, but 
with 4 dimensional spacetime. It begins with the Minkowski metric ^ 

/-I 0\ 

10 

10 

\ 1/ 



(5.1.17) 



We then replace the spatial vector potential A and scalar potential with the single space- 
time 4-vector Potential 

A' 
A^ 



A^'= 



A 



(5.1.18) 



^Admittedly this approach isn't typically useful in classical electrodynamics. However doing so at this 
stage will make generalizing the action easier later. 

^This expression can be found in any introductory text on E&M. 

^While the Minkowski metric generally introduced in an introductory course on special relativity, it 
should have a much deeper and richer connotation after the previous chapter of these notes! 

^We will work with the convention that Greek indices run over all four spacetime components whereas 
Latin indices run only over spatial components. 
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(note that will be A^=(— 0, A)'^, where the minus sign in the first component of 7]^^^). It 
is then straightforward, though very tedious, to show that 



F^u — d^Ay — dyAfj_ 



where 



F = 



/o 


—Ex 


-Ey 


-eA 









-By 


Ey 







Bx 




By 


—Bx 


/ 



is the Field Strength Tensor. We can recover the original fields easily: 



Ei 



E 



Eoi 

doAi - diAo 

-^Ai-dii 
ot 



which is what we had abov^in (5.1.3 1. You can convince yourself that 

Bk = F,,e''' 

Now, consider the quantity d^Fyx- 

d^Fyx = d^{dyAx - dxAy) 
= d^dyAx - d^dxAy 

If we symmetrize this, we find 

d{^Fyx) = d^Fyx + duFxfj, + dxF^y 

= d^dyAx - d^dxAy + d„dxA^ - d^d^Ax + dxd^A^ - dxd^A^ 
= 



(5.1.19) 



(5.1.20) 



(5.1.21) 



(5.1.22) 



(5.1.23) 



(5.1.24) 



So what does this equation mean? You can try this for various components yourself to 
see that this gives 



dB, ^ dEy dEx ^ ^ 

dt dx dy 

dBy ^ dEx dE, ^ ^ 

dt dz dx 

dB, dE, dEy 



dt dy 



dz 



dBx dBy dB 







(5.1.25) 



dx dy ' dz 

^The reason for the minus sign on the time derivative in the third line is that d^,d^, d'^)^, and 

lowering the index puts a minus sign on the time derivative. 
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which are exactly the homogeneous Maxwell equations from (5.1.1 1. So once again, half 



of Maxwell's equations are satisfied automatically if we define things in terms of the po- 
tential A^. 

Writing the inhomogeneous Maxwell equations requires the introduction of the source 
4-vector 

J^={p,jf (5.1.26) 
Then, considering the quantity d^F^^ , we hav^ 

= d^d^'A" -d^A^" (5.1.27) 
Then, looking at the quantity d^F^^ = J^, plugging in different index values gives 

= P 
-- J. 



OF, 


^dFy 

dy 


dF, 

+ 

dx 


dx 


dBy 


dB, 


dF, 


dx 


dy 


dt 


dB^ 


dB, 


dFy 


dz 


dx 


dt 


dB, 


dBy 


dF, 


dy 


dz 


dt 



Jy 



J, (5.1.28) 



which are exactly the inhomogeneous Maxwell equations from ( 5.1. 1[ ) 



So, in terms of the field strength F^y, Maxwell's equations can be rewritten as 

di^^F.x) = 

dyF"'' = (5.1.29) 



However, because of the definition of the field strength ( 5.1.19[ >, the first of these is merely 



an identity and contributes nothing to the physics. In this sense, electrodynamics in terms 
of the F and 5 is a theory with four fundamental vector equations, whereas electrody- 
namics in terms of F^;, (which is written in terms of A^j) is a theory with one fundamental 
tensor equation. 

The Continuity Equation 

Once again, we can start with the inhomogeneous Maxwell equation and take the deriva- 
tive of both sides: 

d^d^F'^" = d^r (5.1.30) 

The right hand side is a symmetric sum over an antisymmetric tensor and therefore van- 
ishes, leaving 

d^J^" = (5.1.31) 

^The field strength i^^" with upper indices is the field strength Ff^i, with lowered indices raised by the 
metric r/^^, and therefore there will be minus signs to keep track of. 
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Writing this out: 

d^J^ = -dop + diJ, + d2J2 + dsJs 

= (5.1.32) 
You can see that it is simply the charge conservation equation (5.1.2[>*^ 



Gauge Transformations 

Next we discuss gauge transformations. Again, doesn't uniquely define a given F^i^. 
Any transformation of the form 

A^^A'^ = A, + d,x (5.1.33) 

leaves F^^, unchanged: 

Pfii^ = 9fiAj^ — dyA^ 

= ^^{A, + ^,x)-^M^^ + ^^^x) 

= d^A^ - d^A^, + d^,d^x - d^d^x 

= d^Ai, — dyA^ 

= F^y (5.1.34) 



Notice that the form of Maxwell's equations in (5.1.11 are specific to three plus one 



dimensional space. However, the form in (5.1.291 are not. For this reason we take ( 5.1.29[ > 



to define Maxwell theory in general (regardless of the manifold). 

With that said we are able to glean some insights into Maxwell theory in dimensions 
not equal to 3 + 1. For example, consider 2 + 1 dimensions. F^y will now be a 3 x 3 matrix. 



We can find it by simply truncating (5.1.20 1 to 






—Ex 


-E, 


Ex 





B 


Ey 


-B 






F.,= \ExQ B \ (5.1.35) 



where we removed the subscripts on B because it is now a scalar. So, interestingly, the 
magnetic field is not a vector in 2 + 1 dimensions - it is a scalar. The electric field, however, 
is still a vector. 

In 4 + 1 dimensions, it is not clear how to generalize the magnetic part. But what is 
clear is that the electric field will still be a vector, and the magnetic field will be a 4 x 4 
antisymmetric tensor - not a vector (or a scalar). 

Continuing with this, in n+ 1 spacetime dimensions, E will always be an n-dimensional 
vector, but the magnetic field will always be an n x ?i antisymmetric tensorp] 

^"Don't worry about the sign difference - it is simply part of how the values are defined. 

^^An antisymmetric 2x2 tensor (as in the 2 + 1 dimensional case) has only one degree of freedom which 
is why it was a scalar. An antisymmetric 3x3 tensor (as in our familiar 3+1 dimensions) has three degrees 
of freedom, which is why we see it as a vector. In general an antisymmetric n x n tensor has — 1) 

independent components. 
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The formalism of this section is much neater than the formalism of the previous sec- 
tion. The most important aspect is that this formalism is manifestly relativistic. Another 
nice aspect is that it it valid in arbitrary dimensions. 

However we still have the shortcoming that it isn't always obvious how to do electro- 
dynamics on an arbitrary manifold. The next section will provide a powerful formalism 
that allows for electrodynamics to be done on any manifold. 



Electrodynamic Lagrangian 

Writing out a Lagrangian in this manifestly relativistic formulation is much easier. As 
discussed in BOH , the Lagrangian that produces Maxwell's equations (and is equivalent 
to (|5TT6l)) 



(5.1.36) 



will give Maxwell's equations ( 5.1.29[ (^ 



We don't need to pursue this any more - we merely wanted to illustrate the vast im- 
provement this notation offers over the vector notation (as in (5.1.16 1). 

5.1.3 Electrodynamics and Forms 

Maxwell's Equations 

The frequent reference to antisymmetric tensors in the previous section should imply that 
we can use differential forms for what we are doing. 
To begin with, consider the 1-form 



{1) 



A^dxf" 



(5.1.37) 



Consider taking the exterior derivative (cf section 2.3.71 of this: 



d^A^dx'' A dx" 



;((9^A^ - duA^)dx'' A dx" 



(5.1.38) 



where we have defined the two form 



(2) 



F^^dx'' A dx"- 



(5.1.39) 



which is, of course, an antisymmetric tensor. Clearly this identical to the previous two 
sections, where A(j) is the potential and F(g) is the field strength, with 



d^Ay — dyA^ 



(5.1.40) 



12 



Again, see the first chapter lllOl for how this is done. 
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What is so nice about this is that the form of the exterior derivative exactly captures the 
form of the field strength from before (compare (5.1.38 1 to (5.1.3 1 and ( 5.1.19| >) 
Identifying F(2) as the field strength, we can make the identificatiorj^^] 



(2) 



Eidx^ A dx^ + E2dx^ A dx° + E^idx^ A dx^ 
-Bidx^ A dx^ - B2dx^ A dx^ - B^dx^ A dx^ 



(5.1.41) 



You are encouraged to show that this reproduces the results in (5.1.21 ) and (5.1.22). 
Next, because of the nilpotency of the exterior derivative (2.3.73) we have 

c?F(2) = d{dA^i)) = d^A(,) = 

Writing this out in components gives 

dF(^2) 

'id^F.x + d.Fxf, + dxF^,)dx^ A dx" A dx^ 



(5.1.42) 



df^Fyxdx^ A dx" A dx 
1 

-^[c)^lFu\ + (Jui'XfM 




(5.1.43) 



Comparing this to (5.1.24 1 we see that they are identical. So the nilpotency of d has given 
us the homogeneous Maxwell equations! You can write this out in component form to 
show that they reproduce (5.1.1 ). 

We now want to write the inhomogeneous Maxwell equations. There is, however, a 
difficulty - they will involve derivatives of F(2), but the only derivative we have to work 
with with differential forms is the exterior derivative, and because of its nilpotency we 
can't use it on F(g). The solution to this problem lies in the Hodge star (cf section 4.3.1 1. 

Let's begin by taking the Hodge star of F(2): 



*F 



(2) 



2!(4-2 
1 



;Ff,ueZdx^ A dx' 



F^,rixrV.ve'"'^''dx^ A dx'' 

Eidx"^ A dx^ + E2dx^ A dx^ + E^dx^ A c/x^ 
-Bidx^ A dx° - B2dx^ A dx^ - B^dx^ A dx^ 



(5.1.44) 



13- 



We are switching from t, x, y, z to a;", x^,x^,x^ for generality. 



290 



Then we can take the exterior derivative of this: 



diEidx^ A dx'^ A dx^ + d^Eidx^ A dx"^ A dx"^ 
+d2E2dx^ A dx^ A dx^ + doE2dx° A dx^ A c/x^ 
+d3E3dx^ A rfx^ A dx^ + d^E^dx^ A rfx^ A dx^ 
-d2Bidx^ A (ix^ A (ix° - d^Bidx^ A (ix^ A (ix° 
-diB2dx^ A (ix^ A rfx° - ^s^arfx^ A dx^ A rfx° 
-diB^dx^ A dx^ A (ix° - d2B'j,dx^ A rfx^ A dx^ 
{diEi + (92^2 + d3E3)dx^ A rfx^ A rfx^ 
+{doE3 + d2Bi - diB2)dx° A dx^ A dx^ 
+((9o^2 + diBs - d3Bi)dx° A dx^ A dx^ 
+(9o^i + ^352 - d2B3)dx° A dx^ A dx^ 



Comparing each of these lines to (5.1.28) we see that if we define 

J(;) = pdx° + Jidx^ + Jgdx^ + Jsdx^ 

then 

*J(_;) = pdx^ A dx'^ A dx^ + Jidx^ A dx"^ A dx^ 
J2(ix° A dx^ A dx^ + J-^dx^ A dx^ A dx^ 

Then, finally, the equation 

d^ F(g) = 

represents the inhomogeneous Maxwell equations. 
The Continuity Equation 



Taking the exterior derivative of both sides of (5.1.48) we have 

d^ F(2) = d-k 3(^1) 



(5.1.45) 
(5.1.46) 



(5.1.47) 
(5.1.48) 



(5.1.49) 



The left hand side vanishes due to nilpotency, and we can write out the right hand side: 



di^J 



(1) 



{dop - diJi - 92J2 - d3J3)dx° A dx^ A dx"^ A dx^ 




(5.1.50) 



Comparing this to ( 5.1.32[ ) we see that, as expected, charge is conserved. 



Gauge Transformations 

Next we define a gauge transformation. This is made particularly easy by the nilpotency 
of the exterior derivative. We leave ^{2) unchanged by the addition of the exterior deriva- 
tive of any arbitrary exact 0-form x- 



A(^) — > A(^) = A(i) + dx{o) 



(5.1.51) 
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Then, almost trivially. 



(2) 



(i(A(j) + dx(o)) 
dA^) + d^Xio) 
dA^) 



(2) 



(5.1.52) 



Application of Stokes Theorem 



Finally, consider again the derivation of the strength of the electric field a distance r from 
a point charge Q from section 5.1.1[ We commented there that the version of Gauss' Law 
we were using isn't generalized to arbitrary dimension in an obvious way. Now that we 
are working in terms of forms this generalization becomes extremely easy. Specifically 
we can define the charge in a region 5" (an n-dimensional Ball) as the integral over the 
source term J (5): 



Q 



d^F 



(2) 



(2) 



(5.1.53) 



where we have used Stokes theorem (3.3.58 1 



Then, noting from (5.1.44) that the electric part of the two form ^F(2) is the purely 



spatial 2-form (across rfx* A dx^ where i,j 7^ 0), which we interpret to mean the spatial 
n — 1 sphere S*"^^ which forms the boundary of B^, or dB^. Then, by the analogous 
argument as in section 



5.1.1 we can assume that at a constant radius r around the center 



of B" the electric field is equal, and therefore we can bring it out, leaving only an integral 
across the boundary 95". So, finally, 

Q = E{r) I dV (5.1.54) 

where the integral simply gives the volume of the sphere S""^^ of radius r, which i^ 

27rt 

V{S^-') = f^^r--' (5.1.55) 

So, 



E(r) 



r(i) Q 



r\ — 

2tt2 r 



n-1 



(5.1.56) 



14 



Again, you can find this in any text on basic geometry. 
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Electrodynamic Lagrangian 

Finally we can write out the Lagrangian in terms of forms. The Lagrangian is defined as 
the integrand of the action, which is (in a relativistic formalism) an integral over all space- 
time. Therefore we should seek a well-defined volume form. In light of the discussion 
surrounding (4.3.40) and (4.3.431, we guess that the kinetic term 

CKin oc F(2) A ^F(2) (5.1.57) 

which we know will be a well defined volume form, is the best guess. Writing this out in 
Minkowski space. 



F(2) A ^F(2) 



1 



{F^^dx^" A dx") A {F^pe'^^dx'^ A dx') 



Ff,uFa(3e<:cofS9^"9'"^e^"'^^{dx° A dx^ A dx^ A dx^) 



^Ff,^F'^'^e^^^se'"'^\dx^ A dx^ A dx^ A dx'^ 
^F^^F^^^idx^ A dx^ A dx^ A dx^) 



(5.1.58) 



Comparing this with ( 5.1.36[ > we see that the appropriate kinetic term for the electromag- 
netic field is 



C 



Kin 



(2) 



(2) 



(5.1.59) 



For the source term, we know from ( 5.1.36[ > that the source J(^) must somehow couple 
to the potential form A(j). These are both 1 -forms, so we again guess that the source term 
should be of the form 



(^Source OC A(_?) A 7^rJ(_;) 



(5.1.60) 



(recall from (4.3.421 that A(;) A = 3(i) A *A(;)). Writing this out gives 

1 



A(i) AVcJ(;) 



g (A^rfx^) A ( J.e^^^da;" A dx" A dx"') 

-AaJuC.B-yidx^ A rfx" A dx^ A dx'^) 
6 ' 

J^e5„^^^'''^e^"^^dx° A dx^ A dx^ A dx"^) 
A^Jf'dx^ A dx^ A dx^ A dx^ 



(5.1.61) 



So the appropriate Lagrangian to agree with (5.1.36 1 will be 

1, 



C 



(2) 



A*F 



(2) 



A(i) A 7lrJ(2) 



(5.1.62) 



293 



5.1.4 Summary of Electrodynamic Formalisms 



We can summarize everything we have done in the past three sections as follow: 





Classical 


Tensor 


Forms 


Maxwell 


V ■ B = 

V -E = p 

V X 5 - — = / 

VAX? J 


d^f^E.x) = 
^^F^''' = J^' 


d^F = 3 


Potential 


^ - at 
B = V X A 


F^u = d^Ay — duA^ 


F = dA 


Gauge Trans. 


A^ A + Vx 


A^^ A^ + df,x 


A^ A + dx 


Stokes 


J^,dVV-E = j,,^^,dA-E 


Same 


{dc\uj) = {c\duj) 


Lagrangian 


very ugly (5.1.161 




-If Ai.F- AAi.3 



The advantages in the use of forms is very obvious here, not only in its efficiency, but 
also in the fact that they provide an index free formulation that is completely indepen- 
dent of the manifold these fields exist on. This will give us the ability to consider, in the 
next few sections, how electromagnetic fields might behave on a topologically non-trivial 
manifold. 



5.2 The Aharonov-Bohm Effect 

The Aharonov-Bohm affect provides a very nice illustration of marriage of mathemat- 
ics and physics. Our presentation of it will be brief, but it will provide a nice illustration 
of how the formalism developed thus far can be used. 

The classical Lorentz force law on a particle of electric charge q is 

F = q{E + vxB) (5.2.1) 

— * — * 

Therefore, if both the E and B fields are both zero, the particle experiences no electro- 
magnetic force. 

Furthermore, as mentioned above, we can write the electri c and magnetic fields in 



terms of the vector and scalar potentials A and as (cf equation (5.1.3 1) 

^ dA „^ 

E = ---V^ 

B = V X A (5.2.2) 

And, as we mentioned above, the vector and scalar potentials are not unique. Under a 
gauge transformation (cf ( 5.1.6[ >) 

A — ^ A' = A + Vx (5.2.3) 
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the electric and magnetic fields remain unchanged (cf (5.1.71 and (5.1.81): 



E — > E 

B — > B (5.2.4) 

So, classically speaking, because it is only the E and B fields that show up in (5.2.1 1, it is 
only E and B that are physically measurable. The potentials, which don't uniquely define 
E and B, are a nice mathematical tool to help simplify solving Maxwell's equations, but 
they aren't considered to be truly physical fields. 

Furthermore, we know from [40J that the gauge symmetry in classical electromag- 
netism is a remnant of the deeper U{1) symmetry in the field Lagrangians. The gauge 
transformation for fields i}) and ijj is 

ijj — i> e'^V 

ij — > tPe-'"" (5.2.5) 

where e~*" is an arbitrary element of And because in quantum mechanics the only 

physically measurable quantity is the amplitude, 

— > ^e-'^e'"^ = tp^p (5.2.6) 

it appears that the gauge potentials are indeed not physically measurable quantities - they 
are nothing more than phases. 

However, this turns out not to be the case. Consider some region of n dimensional 
space U. We first take U to be topologically trivial (say, U = M"). Because U is topologically 
trivial, all cohomology groups (greater than 0) will be trivial: 

H"'{K) = V m > (5.2.7) 

Recall from sections 3.3.3 3.3.5 that the definition of the m*'' cohomology group H^{U) is 



H'^iU) = (5.2.8) 

where is the set of all closed m forms on U (m forms cUm satisfying dcum = 0) and 

B'^ilA) is the set of all exact m forms on U (m forms ujm that can be written as the exterior 
derivative of an m — 1 form am-i- So uJm = dam^i). 



Recall our discussion from section 3.3.5 There we discussed on a factor group G / H 



creates an equivalence class of elements of G where two elements of G are said to be 



equivalent if their difference is in H. So, in the definition of a cohomology group (5.2.81, 
two closed forms are considered "equivalent" if their difference is an exact form. In other 
words, two closed m forms (3m and 7^ are equivalent, or "cohomologous", if their differ- 
ence is exact: 

/5m - 7m = dam-1 (5.2.9) 

for some m — 1 form am — 1. We can restate this as saying that Pm and 7m are cohomolo- 
gous if one can be expressed in terms of the other plus an exact form: 

Pm = 7m + dam-1 (5.2.10) 
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But, as we've said, the cohomology groups for a topologically trivial space like W = M" 
are trivial (equal to zero, the identity element, cf equation ( 5.2.7[ >), we know that all closed 
forms must be exact: 

= (5.2.11) 



and so for any two m forms /3m and 7™ we can find a Pm-i to satisfy equation (5.2.101. In 
other words, all m forms are cohomologous to each other in that any two m forms can be 
related by the addition of an exact form. 

Now let's translate this into electromagnetism in U. Consider some potential 1 form 
A and field strength 2 form 

F = dA (5.2.12) 
defined in W. As discussed above, we can perform a gauge transformation to A (equation 



(5.1.51 )) that leaves the field strength unchanged (equation (5.1.521): 



A — yA' = A + dx — > F — ^F' = F (5.2.13) 

And because the space these fields live in (U) is topologically trivial and therefore has 
trivial non-zero cohomology groups, we can consider any two potentials we want (A and 
A') and they will be cohomologous - there will be a gauge transformation that relates 
them to each other. 

So, consider letting F = in all of U. This means that 

F = dA= (5.2.14) 

We know that it must be a constant (there is no solution to dA = on all of other 
than A = a constant. And if A is a constant on all of U, the field at every point will have 
the same phase, and therefore equation ( 5.2. 6[ > indicates that there is nothing physically 



measurable about the potential. 

Now, however, let's let U be topologically non-trivial. This may mean that, for exam- 
ple, 

H\U) y^O (5.2.15) 
This means that when the field strength is zero (F = 0), there may be a solution to dA = 



for A that is not a constant (cf example starting on page 151 1. And for a non-zero gauge 
field that is not a constant, the phase may be different at different points. 

An experiment has been set up involving sending particles through a double slit as 
usual, except behind the double slit, between the slit and the screen, there is an extremely 
thin solenoid (a wire parallel to the slits that runs through the area the electrons are mov- 
ing through) with a non-zero field inside the solenoid, but a zero field outside it. The 
electrons are shielded from the interior of the solenoid. 
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Electron Gun 




When the field inside the solenoid is zero, the field and the potential outside the solenoid 
is zero. However, when the field inside the solenoid is non-zero, the field outside the 
solenoid is still zero but the potential outside the solenoid is non-zero. The effect of this 
is that, with the solenoid field non-zero, the space becomes topologically non-trivial. It 
is M'^ — M where the M is the one dimensional space the solenoid is running throughpj 
So, because the topology is non-trivial, we can have non-constant solutions to dA = 0, 
and therefore the phase of the field on each side of the solenoid need not be the same. So 



(5.2.6) would now be 

— > ^e-'^'e^'^'il) = e'^"'-°'^'4) (5.2.16) 

This experiment has been carried out, and it has been observed that the interference 
pattern (which depends on the phases of the electrons) does in fact shift when the field 
inside the solenoid is turned on. This affect is called the Aharonov-Bohm Effect. The 
point of the effect is that it demonstrates that at the quantum level it is the potentials, not 
the field strenghs, that are truly fundamental. We will see this point made in much greater 
force in the next paper in this series, but for now what we have said will suffice. 

There is vastly more we could say about the Aharonov-Bohm effect and related ideas, 
and anyone reading who is more familiar with the topic will likely be annoyed that we 
haven't said more. However, once we have the necessary machinery (in the next paper in 
this series), we will return to this topic in much greater detail. 



15' 



The necessity that the solenoid be extremely thin is the major challenge in carrying out this experiment. 
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5.3 References and Further Reading 

The formalisms for electrodynamics can be found in almost any of the differential 
geometry texts listed in chapter |4j We primarily used [19,1 and [|3J, though ||T9l and L45J 
are also very good. 

For further reading on the various applications of geometry and topology to electro- 
magnetism, there are very interesting ideas in |2l, [|23i , |39| . 
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Chapter 6 
Gravity 



It may seem strange to include a chapter on general relativity, Einstein's (classical) theory 
of gravitation, in a series on (quantum) particle physics. But as we mentioned earlier 
there are several reasons it is an appropriate topic at this point. First of all we want to 
understand particle physics as comprehensively as possible, and a quantum theory of 
gravity is a necessary part of that. When we eventually get to the leading candidate 
for quantum gravity, string theory, it will be necessary to understand the theory we are 
trying to quantize. The second major reason to discuss general relativity now is that it is 
a very good way to illustrate the geometrical ideas we have discussed so far in this paper. 
Providing some intuition with what we have done will be helpful when we discuss much 
more complicated geometrical and topological ideas later. 

With that said, we will approach general relativity as follows. We will begin by dis- 
cussing Newtonian mechanics in the more geometrical language we have been using. 
Then we will discuss special relativit}|^ in geometrical language. Then we will discuss 
what lead Einstein to make the leap from special relativity to general relativity, and what 
the consequences his theory are. Therefore, we begin with Newton. 



6.1 Newtonian Geometry 

Simply put, classical, or Newtonian mechanics is geometrically very boring - every- 
thing takes place on with Euclidian metric gij = Sij. Because we have a metric, we can 



define the "distance" between two arbitrary points (cf section 4.1.3): 

ds^ = dx'^ + dy^ + dz^ (6.1.1) 

We take this distance to be an intrinsic part of the space, and we know that it should not 
change if different coordinates are chosen. For example an observer using Cartesian coor- 
dinates and an observer using spherical coordinates should not report different distances 
between two points. Nor should two observers standing in different locations. 

Furthermore, notice that the Newtonian geometry looks only at spatial dimensions. 
Time is treated as a separate parameter, not on the same geometrical footing as space. 
This is why Newtonian physics takes place in three dimensions, not four. 



^We assume you have some familiarity with special relativity 
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Also, because 

d6. 



all of the connection coefficients vanish: 



(6.1.2) 



r}, = (6.1.3) 
and therefore (obviously) this space is flat. This also means that we can write out the 



geodesic equations very easily (equation (4.2.69)): 

(Px"^ 



(6.1.4) 



which is solved to give straight lines: 

x\t) = CiT + Ci (6.1.5) 



where C* are constants of integration depending on boundary conditions. Equation (6.1.5[> 



is clearly the equation of a line and therefore we have proven the well known fact that 
"the shortest distance between two points is a straight line" (in with gij = Sij). 

So at any point we can define a set of geodesies through that point (any solution to 



(6.1.5) that passes through the point), and because we know that there is no curvature, 
we can take the vector corresponding to any geodesic and parallel transport it to any 
other point in the space. Because the curvature vanishes the vector at the new point will 
be independent of the path taken and therefore there is a unique vector at each point. So 
imagine starting with a set of 3 basis vectors at some point. We can parallel transport each 
of these three vectors to every other point in M'^, thus defining a rigid coordinate system 
at every point. 

Furthermore, the fact that all particles move in geodesies through M^, which are 
straight lines, is exactly equivalent to Newton's first law - objects in motion stay in motion 
unless acted on by an external force. This is entirely contained by the statement "in the 
absence of external forces objects follow geodesies". 

This notion of a "rigid coordinate system at every point" is at the heart of Newton's 
view of space. This is also what motivates the classical Galilean transformations between 
two observers moving relative to each other: 

t' = t 

x' = X — vt 

y' = y 

z = z (6.1.6) 

where the observer in the primed frame is moving in the x direction with velocity v and 
their origins coincide at time t = 0. Both observers have identical notions of time and 
clearly the distance between two points (xi, yi, zi) and (0:2, 2/2, ^2) will be the same: 

^{x',-x'^Y + {y[-y',y + {z[-z'^^ = ^((^1 - vt) - {x, - vt)Y + {y, - y^Y + {z, - z^f 

= - X2f + (yi - y2? + (^i - z^f (6.1.7) 
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In summary, the geometry of Newton (M^ with metric 5ij) has profound implications 
- the fact that it is flat allows for a unique set of "parallel" vectors across the entire space, 
creating a fixed, rigid, unmoving, un-dynamical background, as well as define the straight 
line geodesies all particles travel along. As a result, all observers will measure the same 
time no matter what they are doing, and the distance they measure between two points 
will always be the same regardless of what they are doing. 

We now look to see what geometric changes Einstein made in jumping from Newto- 
nian mechanics to special relativity. 



6.2 Minkowski Geometry 

In many ways, special relativity is the clearest demonstration of the true brilliance of 
Einstein. General relativity may demand more mathematical acrobatics, but it was the 
geometrical leap made in moving from Newton's view of space and time to the single 
continuous notion of spacetime that was truly demonstrated Einstein's genius. The funda- 
mental idea of this paradigm shift is that, rather than all physics being carried out on M'^ 
with metric 5ij, physics occurs on M"', but rather than with the Euclidian metric Sij, instead 
with the Lorentz metric 



Vij 
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(6.2.1) 



The most obvious result of this is that time is now treated as a dimension in the geometry 
of the universe, rather than merely a parameter. 

Another equally important consequence is that the simple Galilean transformations 



(6.1.6) must be modified. The reason for this is that it is no longer merely the spatial 
distance between two locations ( 6.1.1[ > that is preserved, but rather the spacetime distance 
between two events^ 



r]ijdx^dx^ = —dt^ + dx'^ + dy^ + dz^ 



(6.2.2) 



This space is called Minkowsi space. The set of transformations which leave this interval 
unchanged is no longer ( 6.1.6[ ) but the Lorentz transformaions: 



t' 
x' 

y' 

z 



t 



vx 



X — vt 

y 

z 



(6.2.3) 



where again the observer in the primed frame is moving in the x-direction with velocity 
V and their origins coincide at t = 0. 



2 We will be taking c = 1 in everything that follows. 



301 



Notice that with the metric defined as above, it is possible for a vector to have positive, 
negative, or even zero norm. For example, a unit vector in the time direction but no 
component in a spatial component: v = (i, 0,0,0)^, will have norm 

r^.-v'v^ = -1 (6.2.4) 

On the other hand, a vector with no time component but a spatial component, for example 
V = (0, 1 , 0, 0)^ , will have norm 

r]ijv'v^ = 1 (6.2.5) 

And finally, consider the vector v = (1,1,0,0)^. It will have norm 

r]--vV = -1 + 1 = (6.2.6) 

We call vectors with negative norm timelike, vectors with positive norm spacelike, and 
vectors with zero norm lightlike or null. 

The geodesic equations are again very simple: 

which has solutions 

x\t) = ClT + Ci (6.2.8) 

So the geodesies are again straight lines in W^. 

At any point p G we can choose a family of geodesies passing through p, each of 
which will correspond to a vector that is either timelike, spacelike, or lightlike. Notice 
that if an observer is on a timelike geodesic, there does not exist a continuous Lorentz 
transformation that will transform him to a spacelike or timelike geodesic, in fact, it is 
not possible to use a continuous Lorentz transformation to transform between any two of 
the three types of vectors. We therefore consider the manifold at any given point and 
vector to be divided into three distinct sections - the future light cone, the past light cone, 
and "elsewhere"]^ This leads to the well known result that all material particles travel 
along geodesies that are everywhere timelike (negative norm) and all massless particles 
travel on null, or lightlike (zero norm) geodesies. 

For an arbitrary timelike geodesic x*(r) = C{r + we can find the velocity vector 

y^='^ = Cl (6.2.9) 
dr 

with norm 

7],,W = r/,,^^ = fc (6.2.10) 

^Any introductory text on special relativity will contain numerous helpful illustrations of all of this. We 
omit them both for brevity and because we are assuming some knowledge of special relativity. 
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where by assumption A; < (because we have assumed that the geodesic is timelike). If 
we introduce a new parameter X — X{t) such that 



then 



dX 

dr 



dx^ dx^ f dX\'^ dx^ dx^ , dx^ dx^ 



(6.2.11) 



= U j dA dA = ^ ^ = ' ^^-'-^'^ 

So, when parameterized with A this timelike vector has unit length. We call this parameter 
the proper time parameter. Physically it is the time as measured by an observer in the 
same inertial frame as the observer on the geodesic x\ 
Furthermore, we have 

Pi = (6.2.13) 

dx'' 

and therefore the connection vanishes, as does the curvature. This means that for a given 
vector at a given point, we can parallel transport this vector to any other point in 
resulting in a unique vector at the new point. For a given observer in some location with 
some velocity, the set of all observers in any location with parallel velocity are considered 
to be in the same inertial frame. These inertial frames are, as you are likely familiar, a 
central theme of special relativity. The consequence of these ideas is that there is no such 
thing as an absolute notion of simultaneity, no absolute notion of "motionless", etc. Two 
things can be said to be simultaneous, or motionless, relative to a given inertial frame, but 
nothing more. 

Of course, the construction of special relativity has the shortcoming of being unable 
to discuss non-inertial, or accelerating frames. This is the domain of general relativity. 
Before discussing general relativity, however, we discuss one final aspect of special rel- 
ativity - the energy momentum tensor. Because almost no book adequately explains the 
meaning of the energy momentum tensor, we will build it up in steps, starting with the 
non-relativistic "stress tensor" and generalizing from there. 



6.3 The Energy-Momentum Tensor 

6.3.1 The Stress Tensor 

Consider some infinitesimal volume of material in three dimensional space which for 
simplicity we will take to be a rectangle. We take the volume of this rectangle to be dV. 
Because the boundary of the rectangle (a rectangular box) is homeomorphic to S^, we can 
take each face of the rectangle to have an oriented area element: 
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A 



\ 

n 




We will choose the convention illustrated where the positive oriented area unit vector n 
is outward from the volume. 

There are two types of forces that can act on dV - volume forces and surface forces. 
Volume forces are forces that act on the entire body at once and are therefore proportional 
to dV . Gravity and electromagnetism are examples of volume forces. The gravitational 
force on this infinitesimal volume will be 

dFgrav = PmgdV (6.3.1) 

where pm is the volume mass density, and the electric force is 

dFeiec = PeEdV (6.3.2) 

where pe is the volume charge density. Volume forces generally very familiar and we will 
therefore not focus on them. 

Surface forces are forces that act on a particular surface of the volume. The most 
common type of surface force is pressure, or the force per unit area. The force due to 
pressure is then proportional to the area element on which the pressure is applied: 

d-^pressure ^ dJ\. (6.3.3) 

Furthermore, there are two types of surface forces - pressure and shear. Pressure is a 
force with components perpendicular to the surface: 
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(we include both positive and negative pressure for clarity). Shear force is a force with 
components perpendicular to the surface: 
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Obviously in three dimensions the pressure force spans only a single dimension (parallel 
to n) whereas the shear force spans a two dimensional space. 

Any surface force is proportional to the area on which the force acts. We therefore 
consider the ratio of the force to the area, which we call the Stress. There are two types of 
stresses - the pressure stress and the shearing stress. The pressure stress is 

Fp 

stress = = P (6.3.4) 

where P is the pressure and Fp is the pressure force, and the shearing stress is 

Fs 

stress — — (6.3.5) 

where Fs is the shearing force. We can use these relationships to discuss the force per 
area, or in other words the force on a small area dA with unit normal n. We will use the 
notation 

dA = ndA (6.3.6) 

Our goal is to write the force on the infinitesimal piece dF as a function of a given volume 
element, so we are looking for a function dF = dF{dA). 

To discover the form of this function, consider three area elements forming a triangle 
at any arbitrary point in dV: 
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Because this is a closed triangle, we can immediately write down the relation 

dAi + (iAg + = 



(6.3.7) 



Also, from both the general form of (6.3.4) and (6.3.51, we can deduce that dF{dA) is 
proportional to the area element dA (cf (6.3.3 1) and therefore 

dF{adA) = adF{dA) (6.3.8) 

where a is some arbitrary constant. 

Finally, consider the net force on this triangle from Newton's second law: 

dF{dAi) + dFidAs) + dF{dA3) = (i(ma) (6.3.9) 

Now consider an identical triangle but scale it by a factor of A in all three dimensions. The 
three surface terms on the left are proportional to area and therefore they will scale by a 
factor of A^. However the mass term is proportional to dV and will therefore scale by A'^. 
So, the net force on this scaled triangle will be 



X\dF{dAi) + dF{dA2) + dF{dA3)) = A^d(ma) 



(6.3.10) 
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which simplifies to 



dF{dAi) + dF{dA2) + dFidAg) = Ad(ma) 



(6.3.11) 



But because this must be valid for any value of A, we must have a = 0. So, we have the 
remarkable property that 



dF{dAi) + dF{dA2) + dF{dA3) 
By Newton's third law we know that 

dF{-dA) = -dF{dA) 







(6.3.12) 



(6.3.13) 



And combining (6.3.71, (6.3.121, and (6.3.131, we have 

dFidAi + dA2) 



dFi-dAs) 
-dFidAg) 
dF{dAi) + dFidAs) 



(6.3.14) 



Combining this with (6.3.8 1 we have that dF{dA) is a linear function in dA. We therefore 
write the components of dF in terms of the components of dA as follows: 



dF\dA) = aUA^ = aWdA 



(6.3.15) 



(where j is obviously summed) where cr] is some 3x3 matrix. The matrix a] is called the 
Stress Tensor of the volume. 

The meaning of the stress tensor is straightforward from the definition ( 6.3. 15[ ). 
Namely, at any point in dV , the force applied to an area element ndA is given by the 
stress tensor via (6.3.151. So, if you know the stress tensor for a medium, you can choose 
any arbitrary area element anywhere in the medium, and the product of the stress tensor 
and the area element unit vector will be the force acting on that area element. 

To get a better feel for this tensor, consider a unit area element pointing in the positive 
X direction: 



dA = dA 



(6.3.16) 



This makes (6.3.15) very simple - the x component of the force on this area that is normal 
to is: 



dF\dA) = a\dA 



Or inverting this. 



0"i 



dF^ 
~dA 



(6.3.17) 



(6.3.18) 



which is the pressure stress in the x direction. 

Similarly, we can see that a] for any i is the pressure stress in the i*^ direction. 
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Consider again dA — dA, but now consider the y component of the force on this 
area that is normal to is: 



dF\dA) = afdA (6.3.19) 

Inverting this, 

al ^ ^ (6.3.20) 

which is the shear stress in the y direction on the area element. 

More generally, cr] is the shear stress in the i*'* direction on an area element normal to 
the direction. 

As a final comment, consider the net torque of some volume of radius r around, say, 
the z axis. The net torque will be 

= {al - al)rdA = ^ (6.3.21) 

where is the z*'* component of angular momentum. If we once again re-scale the vol- 
ume, r and dA together get a factor of A^, whereas gets a factor of A^. So, 

{al - al)rdA = A ^ (6.3.22) 
dt 

and therefore al — a\ = 0, or 

al = al (6.3.23) 

Continuing this with the other components, we see that cr] must be a symmetric tensor, 
giving it a total of 6 independent components. 

6.3.2 The Energy-Momentum Tensor 

We now want to generalize from the purely spatial stress tensor, which was a 3 x 3 
tensor, to a more general form that incorporates time as well as space. To see how to do 
this, consider the meaning of the cr] component of the stress tensor: 



dF 
~dA 

or 



a] = — (6.3.24) 



dF' = ain^dA (6.3.25) 



where dA is the area element normal to the the n, the unit vector in the direction. We 
want to generalize from the purely spatial stress tensor to a spacetime tensor, which means 
that we want to incorporate time. 
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This means that if we start with a spatial unit vector n, the infinitesimal element normal 
to it will no longer merely be an area element, but will also have a time component. In 
other words, it will no longer be dA = ndA, but now will instead be 

dS = ndS = ndAdt (6.3.26) 

where dS is a spacetime interval. 

Newton's laws make this generalization very easy. Recall that the definition of force 

is 

F = ^ (6.3.27) 



This suggests that we should generalize (6.3.251 as 



dF' = ajn^dA — > dp' = ajn^dAdt = ajn^dS (6.3.28) 



Because we assumed that n is a purely spatial vector, each of the indices here are spatial. 
We can interpret the component a* very similarly to how we interpreted the components 
of the stress tensor. Rather than cr* being i^^ component of the force on an area element 
normal to the j*'' direction, we instead take a* as the i direction /Zux of the momentum 
through an area element normal to the j*^ direction,^ As far as a* with spatial indices, this 
definition is identical to the definition of the stress tensor in the previous section. We are 
merely emphasizing "momentum per time" rather than "force". 

But because we are working with spacetime, we must allow for the possibility that n 
be a time-like vector as well. However, the infinitesimal element normal to a time-like n 
will have no time-component - it is purely spatial. Therefore we have in this case 

dS = ndS = ndV (6.3.29) 



where dV is a standard spatial volume element. The form of ( 6.3.28[ > suggests that we 
generalize this as 

dp^ = a^-n^dV = a°ydV (6.3.30) 

where the 0*'^ component represents the time component as usual. But we know from 
relativity that time component of the momentum four vector is energy (hence "energy- 
momentum four vector"). We therefore identify p° with energy, making ctq the energy 
density per unit volume. 

So, we know what (Jq represents, and we know what a* for i,j = 1, 2, 3 represents. 
What about ctq? We know that ctq will correspond to a time-like normal vector, and there- 
fore the relevant spacetime element will be purely spatial (dV). So, this corresponds to 

dp' = a^dV (6.3.31) 

So, 

(6.3.32) 



^Where "flux" relates to the momentum flow per time. 
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is the i^^ component of the momentum density. 

On the other hand, the component cr° can be interpreted from 

rfpO _ ^o^i^g (6.3.33) 
where i is a spatial index. The element normal to a spatial n is dAdt, so this is 

dp^ = a^dAdt (6.3.34) 

from which we can see that 

is the energy flux per unit time through the area element dA. 

This 4x4 tensor is called the Energy-Momentum Tensor, and it will play a very large 
role in general relativity. We have used the notation a* , but the more common notation is 
T/f where we are using the greek indices because they run over all spacetime values. 

Because we are talking about energy and momentum, you may guess that there must 
be some sort of conservation law involved. To see how this arises, consider the expression 



^dV J \dV 

= (6.3.36) 

where the last equality holds by conservation of energy-momentum d^p'^ = 0. Next we 
consider (9^T^^: 



dAdt 

= (6.3.37) 

In other words, the energy-momentum tensor provides four independently conserved 
currents, with conservation laws given by 

d^T!: = (6.3.38) 

We have already discussed the meaning of each component of the energy-momentum 
tensor. We now comment on the quantity T^^n'^ for unit four- vector n'^. This can easily be 
seen by the general expression (6.3.28): 

dp^" = Tl^rfdS (6.3.39) 
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The expression Tj^n'^ is simply the energy-momentum flux through the spacetime interval 
dS normal to n^. But now consider using the Minkowski metric rf^i, to lower the flrst index 
of Ti^ giving 

T^. = V^T: (6.3.40) 
Now consider the expression Tf^j.n": 



dS 



dS 

(6.3.41) 



This is, as we have already discussed, simply the energy-momentum flux through the 
inflnitesimal spacetime volume dS (with the spacetime index lowered by the metric), or 
in other words the spacetime density of the energy-momentum. Now consider taking the 
dot product of (|6.3.41[> with n^: 



dS dS 



= —{p^n^) {63 Al) 



This is the spacetime density of the component of the energy-momentum four vector in 
the direction of n^'. So what does this mean? 

As we said above, T^yTi" is the spacetime density of the energy-momentum four vec- 
tor of the material described by Tl' in the spacetime interval normal to n^. If we take 
n^^ to be time-like, we can interpret it as representing the motion of a particular inertial 
observer relative to the material described by T^^. Then, the spacetime interval normal 
to will be an infinitesimal box of the material as observed by the observer. The 
energy-momentum density of this infinitesimal box will be given by the four vector 

T,.n^ = ^ (6.3.43) 

The component of the energy momentum tensor in the direction of can then naturally 
be interpreted as the energy density of the material as observed by the n^^ observer. In other 
words, 

^n'^ (6.3.44) 
db 

is the energy density of the material as measured by the n^^ observer. And therefore, we 
can take 

T^^n^n^ (6.3.45) 

to be the energy density of the material as observed by the observer. 

To illustrate this, consider letting n^' be a unit vector in the time dimension, 

71^"= {I 0) (6.3.46) 
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This represents an observer with no spatial translation relative to the material described 
by - in other words an observer in the rest frame of the material. This will give 

T^.n'^n^ = Too = (6.3.47) 

which is (minus) the energy density - exactly what we would expect. 

This generalizes nicely to an arbitrary tnertial observer whose motion is described by 
the four- vector v'^'. The expression 

Tf.^v^'v" = energy (6.3.48) 

the energy density of the material as measured by an inertial observer with motion t>^ 
relative to the material. 



As a brief preview of where this is going, recall from sections 4.4.7 and 4.4.8 that given 
a manifold Ai and some vector on that manifold, we can consider the subspace of 
that is normal to v'^'. If we have a metric on Ai, we can form the Einstein tensor Gfj,^. The 
scalar curvature of the submanifold normal to is then given by 

Rsubmanifoid = G^yV^v" = curvature (6.3.49) 

Einstein's general theory of relativity essentially says that energy = curvature. In other 
words, for some observer with motion described by the time-like four vector t>^', the sub- 
space normal to that observer has curvature equal to the energy density. Einstein there- 
fore selE] 

oc (6.3.50) 

So, inserting a proportionality constant k and contracting both sides with some vector w'^, 
we have 

G^uV^v" = kT^^v^v" =^ {curvature) = ^{energy) (6.3.51) 

We will discuss this in much more depth below. We mention it now merely to indicate 
how this will show up in physics. 

6.3.3 The Relativistic Formulation 

In the previous two sections we derived the energy-momentum tensor using a fairly 
intuitive approach so that the meaning of each component could be seen and understood. 
However, recall from any introductory lecture covering Noether's theorem that both mo- 
mentum and energy are conserved currents arising from symmetries in a Lagrangian. 
Specifically, energy is a conserved quantity resulting from invariance under time transla- 
tions, while momentum is a conserved quantity resulting from invariance under spatial 
translations. So if we consider a relativistic Lagrangian that is symmetric under spacetime 
translations, we should arrive at a set of conserved quantities that match the energy- 
momentum tensor from the previous sections. We now set out to do this. 



^cf section 4.5.6 
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Consider some arbitrary Lagrangian density that is a functional of the field 0. The 
action is given by 



S = j (TxCicp, d^(, 



(6.3.52) 



The equations of motion are then given by the relativistic field version of the Euler- 
Lagrange equation: 



^ , dC \ dC 



Now consider a spacetime translation 

— > + 

This will induce the transformation 

Cix^" 



But at the same time, ( |6.3.54[ ) induces 



(xn + a^dMx^) 



So, under ( |6.3.54| > we have 



(50 = a'^di, 



These transformations will then induce 



£(0,9^0) £(0 + 50,9^0 + 5(9^0)) 

= £(0,9^0) + 50 +(9^50). 



90 ^ "'9(9^0) 



and if the Euler-Lagrange equation ( 6.3.53[ > is satisfied, this is 

dC 



£(0, 9/, 



£(0, 9^0) + 509/, ^ 

9£ 



9£ 



5<p 



So, according to (6.3.551, the transformation (6.3.541 induces 



5£ = a^9^£ 



But according to ( 6.3.59[ > it also induces 



(6.3.53) 



(6.3.54) 



(6.3.55) 



(6.3.56) 
(6.3.57) 



(6.3.58) 



(6.3.59) 



(6.3.60) 



(6.3.61) 
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Equating these we get 



dC 



dC 



a^d^cl) - a^Cj = 
a''d^(p-ril^c]a'' = 



(6.3.62) 



(where i]^ is the metric with one index raised). We identify the quantity in the large 
parentheses the Energy-Momentum Tensor, 



which clearly satisfies (from ( 6.3.62[ >) 



(6.3.63) 



(6.3.64) 



meaning that it is a conserved quantity. The interpretation of each component is identical 
to the interpretation in the previous section. 

To see this consider the Lagrangian with two of the spatial dimensions suppressed (for 
simplicity). 



So, 



rp[] 

J- n 



m 

2 



2 J,2 



(6.3.65) 



i2 



-^(-0^ + 0") 



1^2 1^/2 ^ 1^2^2 

2^ 2^ 2 ^ 



2 i2 

-m (p 



(6.3.66) 



which you can recognize as the Hamiltonian density, or total energy density, of the sys- 
tem - exactly what we would expect from Tq You can furthermore work out the other 
components of using (6.3.651 to see that they have the same meaning in terms of the 
energy and momentum. 

6.3.4 The Modern Formulation 



As a final comment, we want to take a closer look at the energy-momentum tensor 
and the action. To begin with, consider a Lagrangian 



C = C{(f), df,(, 



(6.3.67) 



^If you don't understand why this is the Hamiltonian, reread the first chapter of lllOl - you can work out 
the Hamiltonian from ( 6.3.65| using the Legendre transformation discussed there. 
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The general form such a Lagrangian will take will be 

1 



yd,<j)d,<j) - V{<j)) 



(6.3.68) 



So to build the energy-momentum tensor from this involves first calculating (from 

dC 



(6.3.63» 



did. 



Then 



dC 



So we have the general form 



9,0 = -5^09,0 



(6.3.69) 



(6.3.70) 



(6.3.71) 



Now, recall that the action involves an integral over all spacetime. The primary idea 
of general relativity is that the spacetime metric becomes a dynamical field (along with 
all physical fields like 0, etc.). Therefore the geometry of the spacetime manifold is is 
unspecified a priori. For this reason we don't merely integrate C over spacetime as 



S = dxdydzdt C 



(6.3.72) 



Instead we integrate over the invariant volume element (cf section 4.3.2 ), 



S 



dxdydzdt y\g\jC 



We therefore define a "new" Lagrangian density =Sf defined by 
Now consider the expression 



(6.3.73) 



(6.3.74) 



dg 



QgflU 

dC d\/\g\ 



\9\ 



' dg>^'' dg''" 

The derivative in the first term can be calculated easily from ( 6.3.68[ ): 

dC 1 d 



(6.3.75) 



dg 



fii/ 



2dgt"' 



(^?°''5,09/30 + 21^(0)) 



(6.3.76) 
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Now we need to calculate the derivative in the second term in (16.3.751). This is 



d^/\g\ 



{\9\) 



1/2 



d 



Now we make use of a general relation from linear algebra]^ 



So 



(6.3.77) 



(6.3.78) 



(6.3.79) 



Now we can plug these derivatives into (6.3.75 1, getting 



dg 



2 
1 



-d„ 



(6.3.80) 



where we used equation (6.3.71 1 in the last line. This allows us to write 

2 



T, 



\9 



dg 



(6.3.81) 



It turns out that the energy-momentum tensor as defined in (6.3.81) holds in any arbi- 
trary geometry and for arbitrary fields]^ We will therefore take it to be the most funda- 
mental defining statement of the energy-momentum tensor. We will find this expression 
extremely helpful throughout this notes as well as the others in the series. 

6.3.5 The Meaning of the Modern Formulation 



The derivation of (6.3.81 1 above is admittedly lacking in rigor. We "derived" it only by 
showing that it leads to the same result in flat space as the Noether's theorem approach in 
the preceding section /or scalar fields only. We will therefore spend a bit of time discussing 
it in more detail now. We will first derive it in a different way, then discuss what the 
implications of each approach is. 



^This can be found in any introductory text on linear algebra in the chapter on determinants. 

^Though we only did this calculation using scalar fields (p, it is valid with any set of physical fields and 



with any potential. 
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The fundamental idea behind general relativity will be that we take the spacetime 
metric to be a physical and dynamic field, in addition to the physical fields. So, instead of 
the Lagrangiarj^ 



(where (p^ is a vector field for generality), we have the Lagrangian 



(6.3.82) 



(6.3.83) 



where g^u is the spacetime metric. Consider a change of coordinates from, say, x^^ to y^. 
This transformation matrix for this will be 



(6.3.84) 



where the determinant of -R(^ is denoted 

det{Ri^) = R (6.3.85) 
Now consider again the Lagrangian with the addition of the invariant volume term, 

^ = (6.3.86) 



Under the transformation (6.3.841 the invariant volume term will transform according to 




So, under ( |6.3.84| > we have 



— > R^ 



(6.3.87) 



(6.3.88) 



On the other hand we can look at this transformation from another perspective. The 
functional dependence of =Sf is indicated in ( 6.3.83[ >: 



(6.3.89) 



^We are using all covariant indices for simplicity and for later convenience. 
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We can apply the same transformation (6.3.841 to each of these. For 0^ and g^^iy this is easy 
(using the bar to indicate the y coordinates): 



_dx''dx'' _ DaD/3„ 



dyfj- dy^ 



(6.3.90) 



With dn(t)v we must be more careful - we must do what we did in (4.2.2 )): 



d , 



dx°' d \ dx"' 



dx" dx'^ dx"" d'^x"' 



dyf 


dy" 


dx"' 


dx'^ 


dyf^ 


dy" 


dx" 


dx"' 


dyt" 


dy" 



da(f)-y + 



d 



+ 



Qy^^ dx^dy" ^ 
dx" dy^ d'^x"' 
dy^" dx" dy'^dy" 

d^x"' , 



R^Rld^ct)^ + 



dy^^dy-^'^ 



dy^^dy^ 



(6.3.91) 

Writing the second expression in terms of is a bit trickier because it can be written as 

„ d'^x"' 



B W = = d H"' 

dy^dy- " dy-dyf^ " 

So, to make sure we are as general as possible, we write it as 

= R''^Rlda(j)^ + ^{d^R2 + d,Rl)<p. 
So, combining these results we have 

4).. = Rf.ct). 



(6.3.92) 



(6.3.93) 



d, 



R"^R'ld^<P, + -{d,R'l + d,Rl) 



R'^Ru9al3 



(6.3.94) 



Now consider making the transformation R^ to ^ = =Sf(0^, 9^01., (7^1.). Combining 

(6.3.95) 



( I6.3.94D and ( |6.3.88D we have 
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We now do something that will seem very strange, but will be very helpful. We take the 
derivative of both sides of (6.3.95) with respect to the transformation R'^. Doing this will 
require the result from linear algebra (6.3.78), so the right side will be 

d 



dR[ 



(6.3.96) 



(the ^ part has no dependence on 



The derivative of the left hand side of (6.3.95) can then be expanded using the chain 



rule: 



d 



+ 



' dR'^d^^ dRP d{d^^,) ORP d{g^,j 
But from ( 6.3.94[ ) we can find each of the derivatives of the fields in this expression: 

dd),, d 



(6.3.97) 



dR'. 



w 

d 



dR'. 



dR", 



dRi 



[R-Rld^ct^^ + -{d,RZ + d,Rl) 



R';:6Xda<l>, + s'^s;R:d^<j), 

d 

R'^X^tgaP + W^^gaP 

KR'^dap + 6'^R'^gpi3 
{5:R'^^ + 6;R':)g^, 



Now we can plug these into ( 63.97) to get 



d 



OR'. 



(^) 



{R~X^^ 



(6.3.98) 



(6.3.99) 



where we used ( 6.3.96| ) in the last line. 

Now let's take the particular case where = x^, or the identity. This makes 



St: 



(6.3.100) 



and 



R 



(6.3.101) 
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So, we now have 



+ 5, 



dig, 



fii/ ) 



B££ B£e 



'^9pp 



B^ 



dgap 



dg, 



(6.3.102) 



We can then use the Euler-Lagrange equation on the first term, followed by an integration 
by parts: 



B^ 



B^ 



-{dp<pp, 



. B^ 



d{B^(t)„) 



(6.3.103) 



But this will cancel with the second term above, leaving 



B^ B^ 
+ 2- 



-PI dgi 

Rearranging this and writing it in terms of £ rather than ^ , this is 



(6.3.104) 



B^ 



BC 



Bg 



pa 



Or, finally. 



1^1 



2 B^ 



9(9-0, 



-(9p0,) + ^J\g\6„pC 



(6.3.105) 



BC 



\g\ ^dpa 



9(9-0,; 

Comparing this with (6.3.63 1, we can identif}|^ 

2 9^ 



■(9p0,) - 6apC 



(6.3.106) 



Bg 



pa 



T, 



pa 



(6.3.107) 



just as in (6.3.81). 

So what is going on here? The energy-momentum tensor is essentially a measurement 
of the flow of energy and momentum (which viewed relativistically are part of the same 
4-vector) through spacetime, and we can therefore think of it as a conserved Noether 
current. The generators of this conserved current are the invariance of C (or =Sf ) under 
spacetime translations. However, a spacetime translation is nothing more than a change 

^"There is the small difference of the delta function here and the metric there, but had we been more 
rigorous they would have been the same. We are merely trying to communicate the essential idea, not the 
details. 
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of coordinates. So, by imposing (6.3.951 we are essentially imposing the invariance of the 
Lagrangian under spacetime translations. 

However, we can also think of a spacetime translation as a change in the metric. Con- 
sider such a change, — > g^^ + Sg^u- This will result in 



■^{9tiu + Sg 
■^{9tiu) + Sg^,^- h 



dg, 



flU 



1 



(6.3.108) 



So, if you change the spacetime, you change the physics (obviously). And to first order, 
the change in the physics is given by the energy-momentum tensor. This is perhaps the 
most fundamental meaning of T^^,, and stems from the derivation of both the relativistic 



derivation of section 6.3.3 and the derivation in this section. It is essentially a statement 



of the "reaction" of the matter fields to a change in the coordinates (cf the Noether current 
derivation above), which is essentially a local change in the metric. 



6.4 General Relativity 

Introductions to general relativity typically approach the subject by one (or both) of 
two roads. The first is the axiomatic approach, which is the approach Einstein originally 
followed. This involves taking the physical assumptions upon which pre-general relativ- 
ity physics is built and showing how they led Einstein to postulate the theory. The second 
road, while less intuitive, is more fundamental. It begins, like most any other physical 
theory, with an action. Both roads are very useful for different reasons. We will therefore 
consider both, beginning with the axiomatic approach. This will allow us to write out 
the complete field equations for general relativity, thereby specifying the complete the- 
ory. Then, before looking at the action principle, we will look at the meaning of Einstein's 
fields equations, as well as some properties, consequences, and various solutions. The we 
will conclude with the action/Lagrangian of general relativity. 

6.4.1 The Equivalence Principle 

To begin our exposition of this idea, we will begin with an idea from introductory 
physics. Imagine an observer, S, "in space" (the absence of any force fields like gravi- 
tational or electromagnetic), in an inertial frame. Then, imagine a test object in the same 
frame as the observer. Obviously the test object will be stationary relative to this observer, 
and therefore Newton's second law for this system is 

Fnet = (6.4.1) 

If the test object is experiencing an applied force iappUed, this would be modified as 

Fnet = ^applied = ma (6.4.2) 
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Now, consider another observer, S', accelerating relative to the first observer]^ We 
will take their axes to line up at t = 0, and the acceleration to be in the z direction with 
acceleration a^. Now, because S' is accelerating relative to the frame of S, S' is also accel- 
erating relative to the test object. However, relative to S', the test object is accelerating in 
the negative 2;-direction. 



m 



s- 



Relative to S", the the test object is accelerating. Therefore, according to Newton's second 
law, the test object has a non-zero net force. Because in classical mechanics these aren't 
actual forces but rather are a result of the frame of S', they are called Fictional Forces, 
or sometimes Pseudo-Forces. However, let's take them seriously and write out New- 
ton's law for the object in this frame. It is accelerating downward (in the z direction) with 
acceleration a^, it has mass m, and so 

F„et = -ma-^e^ = ma (6.4.3) 

^^Of course there is a fundamental difference in these two observers in that S* is in an inertial frame 
whereas 5" is not. Therefore special relativity doesn't apply to S'. Observer 5* will not be able to tell the 
difference between his frame and any other inertial frame, but S' will be able to "feel" the acceleration of 
his frame and will therefore know that he is accelerating relative to S. 
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where is a unit vector in the positive z direction. If the object is experiencing an applied 
force ^applied, this would be modified as 



Fne£ = -maze^ + i applied = ma. 



(6.4.4) 



We could rewrite (6.4.41 as 



where 



Fl 



net 



'^applied 



net 



(6.4.5) 



(6.4.6) 



Note that ( 6.4.5[ > has exactly the same form as (6.4.2). In other words, if we are working 
with the "net force equals mass times acceleration" form of Newton's second law, we can 
generalize to non-inertial frames by replacing "net force" term on the left hand side with 
the fictional force terms. The right hand side of the equation is then the same as in the 
inertial frame case. 

Now consider what will appear to be a very different situation. Imagine an observer 
sitting on a planet, and therefore in the presence of a gravitational field. We choose the 
observer's axes so that gravity is pointing in the negative z direction with gravitational 
acceleration to be If there are no other forces acting on the object, Newton's second 
law gives 



Fnet = -mge^ = ma 
If there are any other forces acting on the object (iappUed), this will be 

Fnet = -mge^ + f applied = ma 
We can take the gravitational force term to the left hand side, getting 



F' 



net 



'■applied 



where 



Ket = Fnet + mge. 



(6.4.7) 



(6.4.8) 



(6.4.9) 



(6.4.10) 



Now compare F'^^^ in ( |6.4.10[ ) to F^^^ in ( |6.4.6| >: 



( I6.4.6D 
(I6.4.10D 



f: 



net 



net 



ma,e. 



F' 



net 



Fnet + mge^ 



(6.4.11) 



If we take the acceleration due to gravity to be 5^ = a^, we see a remarkable coincidence 
- the inertial mass in (6.4.6) is equal to the gravitational mass in (6.4.10)1 Of course this co- 
incidence may not seem particularly profound. But we will see that, in fact, it is this 
coincidence that forms almost the entire foundation of general relativity. In fact, it is so 



^^On the surface of the Earth this would he g = 9.8 m/s^. 
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important that it has its own name - The Equivalence Principle. The equivalence princi- 
ple can actually be stated several ways. This is simply the first. 

Now consider an observer in a box with no windows and no means of observing 
anything outside of the box. The observers can feel that he is accelerating towards the 
"floor" of the box[^ He happens to have a test object of known mass m with him (he 
carries it around with him in case he ends up inside a box like this one). He holds it in 
front of him, lets go and of course it accelerates towards the ground. He measures the 
acceleration of the object to be a. He concludes that Newton's law for this object is 



net 



-maez (6.4.12) 



where once again is simply a unit vector pointing towards the "ceiling". Then, being 
somewhat familiar with Newton's laws in non-inertial frames, he realizes that there is an 
ambiguity in his situation. Specifically, he wonders of he is in a box out "in space" that is 
accelerating at a (with no other force fields around), or if he is in a box sitting still in the 
presence of a gravitational field with gravitational acceleration a. After a few minutes of 
pondering he realizes a somewhat frustrating consequence of the equivalence principle. If 
there was a slight difference in the inertial mass and the gravitational mass of the particle, 
perhaps he could devise some experiment to see which mass was coming into play in his 
box (assuming he knew the mass(s) of the object before getting into the box). However, 
because gravitational mass equals inertial mass, he realizes despairingly, it is impossible 
for him to tell the difference. Until he can get out of the box and look, there is no difference 
from his point of view between sitting still in a gravitational field and accelerating in the absence 
of a gravitational field. 

Now consider another observer in the same type of box. He doesn't feel any acceler- 
ation. He also carries a test mass with him, so he holds it in front of him, let's go, and of 
course it remains floating in front of him - no motion at all (relative to him). He initially 
concludes that he must be out in space (away from any force fields), and in an inertial 
frame. Being familiar with special relativity, he despairs that because he can't see out of 
the box, he doesn't know anything about his motion other than the fact that it is inertial. 
After a moments thought, however, he realizes that the situation is much worse. If there 
is an observer in a gravitational field with gravitational acceleration g, but the observer is 
in "free fall" in this field, then the observer's acceleration will obviously be g. This means 
that, in the frame of the planet causing the gravitational field, the net force on the test 
mass is 

Fne4 = -m^e^ (6.4.13) 
But, because the frame of this observer is accelerating at g as well, the fictional force term 



will be rngBz. So, equation (6.4.9) becomes 

F'net = Fnrf + -mge^ = -mge, + mge^ = (6.4.14) 
And therefore equation ( 6.4.8[ > is 

^applied = (6.4.15) 

^^Of course he calls that particular wall of the box the "floor" merely because that is where he is acceler- 
ating. 
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So, this observer realizes, not only does he not know what inertial frame he is in, but also 
he doesn't even know if he is in an inertial frame. He could be in free fall the gravitational 
field of a planetj^ 

These considerations bring us to another statement of the equivalence principle. 
Specifically, It is impossible to distinguish between acceleration in the absence of gravitation from 
remaining stationary in the presence of gravitation. 

Of course, the observer in the second box realizes that if the box were very large, 
and he had two test masses, he may be able to determine which situation he is in. The 
following diagram illustrates how: 




The planet (the circle) pulls both the box and the test masses towards its center. If the box 
is big enough, the observer in the box would be able to drop the two test masses and see 
if they move towards each other as they fall. If the box is not sufficiently large, however, 
this is not possible. 

So, in light of this realization, we modify our statement of the equivalence principle 
above: It is impossible to distinguish between acceleration in the absence of gravitation from 
remaining stationary in the presence of gravitation, in a sufficiently small region of spacetime. 

The equivalence principle we have discussed so far is really nothing more than a co- 



incidence of Newtonian theory. It was the similarity of equations (6.4.6) and (6.4.101 that 
lead us (and the observers in the boxes) to realize the equivalence of various scenarios. 
Of course, if it turned out that gravitational mass, rrig, is not equal to inertial mass, rrii, 
Newtonian theory would have no problem. We would simply have two different mass 
terms in ( |6.4.6| > and ( |6.4.10| >. However, countless tests have been done to test this fact, and 
to date there is no evidence that there is a difference. In other words, as far as we can tell 



^^This is not only frustrating because of a fundamental scientific and epistemological limitation, but also 
because he might be about to slam into a planet. 
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experimentally. 



rrig = rrii (6.4.16) 

It was this that lead Einstein to make the underlying assumption that led to his general 
theory. Rather than take the equality of nig and rrii as an interesting coincidence of New- 
tonian theory he decided to make it a fundamental postulate. 

The reason this is so important is that we can reason from what we know of the equiv- 
alence principle so far to yet another statement of it. Recall from introductory physics 
that the force of a planet of mass M on a test mass of mass m is 

Mm 

Fgrav = G— (6.4.17) 



Setting up Newton's second law. 



= ma (6.4.18) 



The m on the right side of this equation is the inertial mass - whereas the mass on the 
left side is the gravitational mass. But because of the equivalence principle we know that 
they are equal, and therefore m divides out of the equation. 

M 

a = G^ (6.4.19) 

So, the acceleration is independent of the mass of the object. This leads to the alterna- 
tive statement of the equivalence principle: The motion of an object in a gravitational field is 
independent of its mass or composition. 

This statement, however, has extraordinarily profound implications. This is because, 
if the motion of an object in a gravitational field is independent of its mass or composition. 
It is impossible to shield any object from the effects of gravitation - in other words, gravity couples 
to everything. 

To see why this is so important, consider electromagnetism. How can the strength of 
the electromagnetic field be measured? The simplest way is simply to take two particles 
of known mass and composition - one that is uncharged and one that has a known charge. 
The uncharged particle will not interact with the electromagnetic field and we can there- 
fore take its motion to be "inertial". It is then sensical and well defined to measure the 
acceleration of the charged particle relative to the uncharged particle, and in doing so we 
know the force on the charged particle and through the equations of electromagnetism 
the strength of the electromagnetic field can be determined. 

However we don't have this luxury with gravity. There is no equivalence principle for 
electromagnetism. We cannot say that the motion of an object in an electromagnetic field 
is independent of its composition - it depends very much on the charge. We cannot say 
that the electromagnetic field couples to everything - it doesn't couple to things that don't 
carry electric charge. An observer trapped in a box can tell the difference between inertial 
motion outside of an electromagnetic field and "free fall" inside an electromagnetic field 
by observing the motion of objects with different composition and charge. 
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But with gravity, not only is the motion of the test object completely independent of 
the composition, but also it is completely impossible to shield anything from gravity to 
act as an "inertial" object. In other words, whereas with electromagnetism an uncharged 
object can act as the inertial object from which to compare the acceleration of the charged 
object, with gravity we have no such inertial objects because everything is affected by 
gravity. 

So, if everything is affected by gravity, and if the motion of everything is the same re- 
gardless of its composition, it becomes possible to set up a set of "preferred" world lines, 
or curves in spacetime, along which any arbitrary object will travel (and are therefore 
geodesic lines). And because these world lines don't depend at all on the objects travel- 
ing through spacetime, a natural conclusion is that the world lines don't depend on the 
objects, but instead depend on the spacetime itself. And finally, this leads to the radical 
conclusion that the influence of gravity is not due to an field in spacetime as in electromag- 
netism, but rather is due entirely to properties of the spacetime. So, we have elevated 
spacetime from merely the background in which physics happens to a dynamical player 
in physics. Specifically, The spacetime metric is not flat as in special relativity (•q^J. Rather, the 
motion of particles in a gravitational field are the geodesies on a curved spacetime. 

This leap, from the assumption that gravity is a field like the electromagnetic field, to 
the assumption that in fact there is no gravitational field, but rather gravity is the effect 
of the curvature of spacetime, is a direct consequence of the equivalence principle, the 
fundamental basis of general relativity, and along with quantum mechanics one of the 
most radical paradigm shifts in the whole of science. 

Recall that in special relativity a key idea is that of an inertial frame. It is not possible 
(in special relativity) to uniquely choose an inertial frame that is "at rest" - in fact there is 
no notion of absolute rest or absolute motion. However (in special relativity), it is possi- 
ble to single out frames that are accelerating - there is a notion of absolute acceleration. 
This is what allowed us to find the strength of the electromagnetic field - we can tell the 
difference between non-accelerating and accelerating. But, with our new understanding 
of gravity, nothing can be shielded from gravity, and therefore there is no way of giving a 
meaningful definition of "the acceleration due to gravity". 

However, this doesn't mean that we must give up a notion of a preferred collection 
of reference frames. Recall above that the observer in the box that is in free fall in the 
gravitational field will experience no acceleration, and there will be zero net force on an 
object in this frame. This leads to the notion that we should define "non-accelerating" as 
"freely falling in a gravitational field". 

Of course, this definition appears at first to be backwards. We are saying that an object 
sitting still on the surface of the Earth is actually accelerating, whereas an object that is 
accelerating down towards the Earth at 9.8 m/ is not actually accelerating. However it 
is actually the most sensible definition. Recall that in our new understanding, gravity is 
not a "force" but rather represents a deviation from a flat metric. So, consider tine object 
sitting still on the surface of the Earth. Free fall would dictate that the geodesic it "wants" 
to follow is the one that moves it through spacetime towards the Earth's center. However, 
because of the normal force from the surface of the earth, the object doesn't follow this 
geodesic. Because in our new paradigm, gravity is not a force, the only true force acting 
on the object is the normal force of the ground. Therefore, with this single force, the object 
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is actually accelerating away from the center of the Earth! The fact that, when an observer 
is standing still on the surface of the earth, he feels the force of the earth on his feet is 
further evidence of this. On the other hand, the observer freely falling in the gravitational 
field doesn't "feel" any acceleration So, our designation of free call as non-accelerating 
and stationary as accelerating is actually the most sensible way to think! 



To emphasize this, recall from section 4.2.5 ff that a geodesic is the path a particle will 



follow in the absence of any other forces. Any deviation from a geodesic is caused by 
an external force. So, we can take an object with no force to be an object following a 
geodesic. As we said, an object following a geodesic will freely fall and "accelerate" (in 
the Newtonian sense) towards the center of the Earth. Because it is following a geodesic, 
it isn't experiencing an external force, and therefore is "non-accelerating". However, the 
object sitting still on the surface of the earth is deviating from a geodesic, and therefore is 
experiencing an external force, and therefore is accelerating. 

As a brief comment, the previous considerations allow us to make a final observation. 
Consider an observer in free fall in a gravitational field. This observer is by definition 
not accelerating, but rather is merely following a geodesic without any external forces. 
But according to the equivalence principle, this is absolutely identical to being in an in- 
ertial frame in the absence of any gravitational field. So, we have the conclusion that, for 
any gravitational field, there is a reference frame (namely, the free fall frame) in which 
the effects of gravity are completely non-existant. For an observer in a free fall frame, there 
essentially is no gravity whatsoever, and there is no loca|^ experiment he can do to detect 
any gravity. So, in such a frame gravity can be completely ignored (locally). 

However, other forces like electromagnetism don't have an equivalence principle, and 
therefore there does not exist a privileged frame frame where the electromagnetic field 
can be ignored. It is always possible to choose an object that is uncharged, or with a 
different charge, to compare one's motion to. This difference between gravity and all 
other forces is a particularly important quirk of the equivalence principle. We will see that 
it has radical physical consequences when we attempt to create a geometrical theory of 
non gravitational forces. The primary difference, as we will see, between gravity and the 
other forces is that gravity is a theory of the geometry "of" spacetime, whereas the other 
forces are theories of the geometry "in" spacetime. Another way of putting this is that 
gravity involves altering the spacetime metric and consequently the metric connection 
is altered. In this sense general relativity is a theory of the Levi-Civita Connection, (cf 



section 4.2.71. The other forces, on the other hand, are theories where the spacetime metric 
is unchanged, but the connection takes on a form other than the Levi-Civita connection - 
the fields create a non-metric connection on top of the metric connection. To make better 



sense of this, consider the rather strange example of the connection in section 4.2.4 with 



geodesies graphed starting on page 199 Defining the connection required us to use the 
standard Euclidian metric on M^, but we were choosing a connection unrelated to the 
Euclidian metric. And as a result, despite the space being flat (according to the metric). 



^^Consider the Vomit Comet. 

^^Of course, he could look around to see if there is a planet nearby, or if he had enough space he could 



set up an experiment like in the picture on page 327 By "local" we mean that he cannot do these types of 



experiments that require looking beyond his immediate surroundings - he's trapped in a small box like the 
observers discussed above. 
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the geodesies were not the usual straight lines we'd expect in flat M?. The meaning of 
this (which we will spend the next paper discussing in much, much greater detail) is 
that the connection we chose, which was not the metric connection, acted as a "fleld" 
which changed the geodesies the particle was traveling under. In other words, there are 
some particles that may "see" that connection and others that wouldn't. Any particle that 
could see the connection would follow those geodesies. Any particle that couldn't see 
that connection would follow the usual flat space straight line geodesies. However, if the 
space wasn't flat - i.e. if we changed the metric of the space from Euclidian to non-flat, 
it wouldn't be possible for any particle to be oblivious, and all particles would follow 
the non-straight line geodesies. Adding a field of some sort to the non-flat space would 
simply cause even more exotic geodesies. It is this that outlines the difference between 
gravity and the other forces. All other forces create an additional connection in addition 
to the spacetime metric connection, and only those particles that can see that additional 
connection will alter their geodesies accordinglyp^ Those particles that cannot see the 
additional connection will ignore it and follow the geodesic any particle would follow 
were the additional connection not there. On the other hand, because gravity represents a 
deviation in the connection of the spacetime metric, any particle that is in spacetime (which 
is, of course, all particles) will "see" it and therefore the geodesies of all particles will 
be affected. This is the essential content of the equivalence principle - nothing can be 
shielded from gravity because gravity affects spacetime itself. 

All of this brings us to our first mathematical result of this section. We are claiming 
that gravity is the result of deviations from a flat spacetime metric. And therefore, the 
absence of "gravity" is actually the absence of curvature. So, in the absence of curva- 
ture the spacetime metric is flat. And therefore the geodesies are "straight lines" through 
spacetime - corresponding to solutions to 



(cf section 4.2.5 where the connection coefficients vanish because the metric is flat, cf 



equation (4.2.1141). However, in the presence of spacetime curvature, the metric is not 



flat, and according to ( 4.2.69[ >, the geodesic equations will be 



+ r°-^-^ = (6.4.21) 



rfr2 dr dr 

The solutions to this will obviously not be straight lines for F^^, 7^ 0. 

As a brief comment before moving on, consider what we have done from a more 
mathematical point of view. In special relativity the underlying theme was all inertial 
frames are equivalent^ Mathematically this was stated as the idea that all observers re- 
lated by Lorentz transformations are equivalent. A Lorentz transformation is defined as a 
transformation leaving the metric r]^^ unchanged. In other words, the physical statement 
"all inertial observers are equivalent" is the same as the more mathematical statement 
"observers related by transformations preserving the metric". However, we are now al- 
lowing for transformations that don't preserver the metric - the whole point is that we are 



^^We will call "seeing" these connections "charge". 

^^This was called the Principle of Special Covariance, or as the Principle of Lorentz Covariance. 



331 



considering the metric to be dynamical, and the deviations from flat are interpreted as 
gravity! So, we are now talking about a more general set of observers - not merely the 
ones who see the same metric as in special relativity This leads to another perspective on 
general relativity - the Principle of General Covariance, which says that all observers are 
equivalent. 

6.4.2 Einstein's Field Equations 

This now raises the question - what causes the metric to curve. We have essentially 
said 



but what causes curvature? Finding the geodesic equations requires knowing V^^, and 
finding F^^^ requires knowing the metric. So, how can we find the metric, and what makes 
it curved? When will it be flat? In other words. 



Most introductory texts or notes on general relativity will, at this point, dive into a discus- 
sion of a idea called Mach's Principle. This is somewhat misleading, however, because 
while Mach's thinking initially inspired Einstein, Einstein ultimately rejected the central 
notions of Machian thought, and general relativity is not a truly Machian theory. Rather 
than discuss Mach's principle, we mention some of the features that survived in general 
relativity. 

In special relativity, the principle of special covariance stated that all inertial frames 
are equivalent. The meaning of this is that there is no such thing as absolute motion or 
absolute rest. No observer can be said to be truly and absolutely "at rest" or truly "in 
motion". Only relative velocities can be measured. However, in the context of special 
relativity, it is very easy to differentiate between inertial and non-inertial observers. This 
is precisely the limitation of special covariance - it is limited to inertial frames (frames 
related by transformations that preserve the metric 77^,^). Einstein's leap was to reject this 
distinction between inertial and non-inertial frames - he said that not only is it true that 
motion is relative, but also it is true that acceleration is relative. In other words, one 
cannot talk about absolute acceleration - only acceleration relative to something else. If one 
were to be in a universe with no other matter in it, acceleration wouldn't be possible (the 
idea of acceleration couldn't even be defined). However, because we do live in a universe 
with matter it it, we can talk about acceleration in a meaningful way relative to that matter. 
And because it is matter that allows us to talk about acceleration, Einstein guessed that 
it is specifically this matter that is the cause of acceleration! And from special relativity 
we know that matter is nothing more than energy {E = mc^), he made the guess that the 
underlying equation of general relativity should be 



gravity = curvature 



(6.4.22) 



gravity = curvature = ? 



(6.4.23) 



gravity = curvature = energy 



Or in mathematical terms (cf equation (6.3.51 )), 



kT, 



(6.4.24) 
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(where k is a proportionality constant to be determined later, G^^, is the Einstein tensor. 



and T^y is the energy-momentum tensor). Equations (6.4.24) are called the Einstein Field 



Equations. They are a set of differential equations for the metric (cf the end of section 



4.5.6 1 which explain how curvature is governed by energy/ mass. 

Notice that ( 6.4.24| > has the exact form we stated it would have in (6.3.51 1. 



The implications of equation (6.4.241 are profound. They state that, on the one hand. 



the motion of matter in the universe is determined by the geometry of the universe. On 
the other hand the geometry of the universe is determined by the distribution of matter 



in the universe. We will see the results of this in several examples of solutions to (6.4.241 
later in this chapter. 

6.4.3 The Correspondence Principle and Newtonian Gravity 



Einstein's general relativity, as encapsulated in equation (6.4.24), leads us to ask two 
questions. The first is simply what the value of k should be. The second is the slightly 
more complicated question of whether or not general relativity is consistent to Newtonian 
gravity in the appropriate limit. We treat these two questions together because answering 



one will answer the other. Namely, by showing that (6.4.241 is consistent with Newton's 
"one over r squared" law, we will be able to see what the value of k should be. 

We can rewrite ( 6.4.24[ ) in terms of the definition of the Einstein tensor (cf equation 
(|4A7T|), getting 



kT, 



(6.4.25) 



We can take a trace of both sides (contracting the indices with the metric) and, denoting 
the trace of the energy momentum tensor g^^T^u = T, 



R 



kT 



(6.4.26) 



Then, using (4.1.141 (along with the assumption that we are working with 4 dimensional 
spacetime) and (4.4.68 1, we have 



R--RA 
2 



kT 



(6.4.27) 



or 



R = -kT 



(6.4.28) 



We can plug this back into (6.4.24), which effectively swaps the role of R^u and T^^' 



(6.4.29) 



or 



(6.4.30) 



333 



Now consider a time-like unit vector f^. We can contract both sides of (6.4.301 with 
this vector. 



(6.4.31) 



On the right hand side, the first term is simply the energy density observed by someone 
in the frame (cf equation (6.3.48 1). We will take it to simply designate the mass density, 
which we will denote p. The second term on the right hand side is the dot product of 
with itself, which is 1 because we assumed is a unit vector (g^ytH" = t^t^ = t^ = 1). So, 
(I6.4.31D is 



(6.4.32) 



Then, we can choose our vector t^^ to correspond to the rest frame of the matter T^y de- 
scribes, meaning that the diagonal elements other than Tqo vanish. Then, because Too is 
the energy density in the rest frame of the matter. Too = Pr and therefore T = p. 



(6.4.33) 



or 



R^.•^tn'' = -np 



(6.4.34) 



Now consider that the matter density is a sphere of radius r and mass M with uniform 
mass density (so pV = M). The volume of this sphere will be 



V = -nr^ 
3 

We can take a first and second time derivative of this, getting 



(6.4.35) 



V 
V 



(6.4.36) 



Now, considering this in the context of what we did in section 4.4.4 equation (4.4.26 1 tells 
us that the second term here vanishes, leaving 



V = 4:7ir r 

Then, plugging ( 4.4.39[ > into ( 6.4.34[ > we have 

V 1 



V 



1 M 



(6.4.37) 



(6.4.38) 



or 



(6.4.39) 
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Or, the acceleration a = r is 

a = f ^ (6.4.40) 

Sir 

This is exactly the Newtonian expression for acceleration in a gravitational field if we set 

K = SttG (6.4.41) 

where G is Newton's gravitational constant. 

So, we have succeeded in not only showing that general relativity does indeed reduce 
to Newtonian gravity in the low curvature limit, but also finding the proportionality con- 
stant. So, the full Einstein field equations can finally be written as 

G^, = 8nGT^, (6.4.42) 

or if we work in "natural" units where G = 1, the more familiar form, 

G^, = SttT^, (6.4.43) 

6.4.4 The Function of Einstein's Field Equations 



There are several ways we can view (6.4.43). The first is "from right to left" - they are 



a set of equations to give you the geometry once the matter distribution (T^u) is known. 
In other words, in this view, we use whatever physical information we have to piece 
together the energy momentum tensor. Then, by setting G^u equal to whatever this turns 
out to be, we have a set of differential equations for g^^, which we can solve to completely 
determine the geometry. This approach, while nice in principle, is rarely practical. The 
Einstein tensor ends up being a highly non-linear expression of the derivatives of g^^, and 
it is almost never possible to solve it for g^^ directly. 

The second way to look at ( |6.4.43 > is "from left to right" - starting with some metric we 



compute G^,y and consequently know everything about the matter distribution by simply 
reading off T^^. However, this is also rarely useful because it is rare that an arbitrarily 
chosen metric will yield a physically meaningful energy momentum tensor. It is possible 
that this approach results in something useful or interesting, but that is not the norm. 
Finally, there is the way that has proven to be the most useful. Typically, it is not the 



case that we know either side of (6.4.43) completely but nothing at all about the other 
side. We usually know a little about the matter and a little about the geometry. This 
allows us to make a simultaneous choice of the general form of (^^ji, and T^i,, which is 
called an ansatz. Einstein's equation then provides a set of constraints between the two. 



The example we did in section 4.5.6 is a good example of this. We were able to guess 



the general form of the metric in ( 4.5. 50[ ), and the form of the energy-momentum tensor 



in equation in (4.5.521. We didn't have complete information about the metric or about 
the matter distribution, but by the ansatz we assumed we ended up with a much nicer 
differential equation which acted as a constraint on both sides. How this works physically 
should be more clear when we do examples shortly. 

So, the way we will approach problems in general relativity is to assume a form of both 
the metric and the matter distribution, and then plug everything in to get a differential 
equation. If our ansatz was well chosen, the solutions to the differential equation will 
give a good physically meaningful answer. 
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6.4.5 The Cosmological Constant and Additional Comments 



Before looking at actual solutions to (6.4.431 we make a few more comments about the 



general structure of Einstein's field equations. 



It turns out that there is a way that (6.4.431 could be modified. This is through the 



addition of a Cosmological Constant term, denoted A. This term is simply added to the 
field equation: 



G^^ + Ag^,u = R^,u - \r9^.u + Ag^^u = T^u (6.4.44) 



Initially Einstein was looking for solutions that would represent a static universe because 
he was unaware of the evidence that the universe had a beginning. No such static uni- 



verse solution exists for (6.4.431 in a universe with matter, so he added the cosmological 
constant termj^ If the cosmological constant term is chosen in a particular way, static 
solutions can exist but they are unstable. With Hubble's discovery of an expanding uni- 
verse and the resulting big bang theory, static solutions became unimportant and Einstein 
rejected the need for A. 

However, despite the initial apparent lack of necessity, A has managed to stick around 
for a variety of reasons. One of the most interesting reasons is if we consider the situation 
with no matter: 

G^,u + A9^,u = Rf^u - ^Rgp^u + Ag^u = O (6.4.45) 
We can then bring the cosmological constant term to the other side, 

G^u = Rf,u - ^Rgf^u = -^g^,u (6.4.46) 

In this case we can interpret the cosmological as an alternative energy momentum tensor, 
T^^ = —Ag^^. So, even in the absence of matter there is a non-zero energy momentum 
tensor, and we therefore interpret A as the energy density of the vacuum. 

This interpretation is actually very convenient. Recall from | |40| that the probability 
for virtual particles to appear in the vacuum is non-zero. This implies that there must 
be something in the vacuum which can cause this, so a non-zero energy density in the 
vacuum is an appropriate concept. Quantum field theoretic considerations and renor- 
malization imply that the value of A is approximately equal to the fourth power of the 
Planck mass rrip 

A ^ (6.4.47) 

where nip ^ 10^^ GeV. However, observations of the universe indicate that this is incor- 
rect, and rather the actual value of A is smaller by a factor of at least 10^^° - this is called 

^^However it has been said that Einstein later called the addition of the cosmological constant the biggest 
mistake of his life. 
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the Cosmological Constant Problem. This is, to date, the largest discrepancy between 
theory and experiment in the entirety of science and is considered to be one of the most 
fundamental and important unanswered questions in physics. 

There is quite a bit more we could say about Einstein's equation, especially about 
how it can be generalized. There are all kinds of terms we could add to it, but with the 
exception of the cosmological constant, they are almost always omitted for a variety of 
reasons, and we therefore won't bother discussing them. 



6.5 The Schwarzschild Solution 



We are now in a position to talk about actual solutions to (6.4.43 1. We will begin with 
a discussion of the simplest case - the absence of matter, or the vacuum (T^^ = 0). 

6.5.1 General Vacuum Solutions 



If there is no matter, we can take T^i^ = 0, and therefore ( 6.4.43[ > is (including the 
cosmological constant term for generality) 



Now contract both sides with the metric, 

g^^-R^, - ^Rg^'g,, + Ag'^'g,, = 
=^ R-2R + 4:A = 
=^ R = AA 

Or if the cosmological constant is zero, 

R = 

But, in the case of A = 0, we also have from Einstein's equation with T^^ = 

RgfjLu '^Rfii/ 

and therefore R = implies 

Rfiu = 

The most obvious solution to ( 6.5.5| > is the Minkowski metric 



/- 


-1 








o\ 







1 
















1 





V 











V 



(6.5.1) 



(6.5.2) 



(6.5.3) 



(6.5.4) 



(6.5.5) 



(6.5.6) 



(recall that we showed that the connection coefficients all vanish for the metric r]^^ in 
equation ( 4.2.120[ >, and therefore the Riemann tensor trivially vanishes, and therefore the 
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Ricci tensor also trivially vanishes). So flat space is indeed a solution to Einstein's equa- 
tions in the absence of matter. 

We call the class of all solutions to ( 6.5.5[ > to Vacuum Solutions to Einstein's equations. 



And while flat Minkowski space is the simplest example, there are other non-trivial ex- 
amples. Consider, for example, the metric near a planet. If the planet has radius R, then 
a point a distance r > R will contain no matter and therefore T^^ = 0, but obviously the 
curvature will be non-flat. So there must be a vacuum solution that is not flat. We now 
find such a solution for the simplest case. 

6.5.2 The Schwarzschild Solution 

We want to find the vacuum solution for Einstein's equations in the vicinity of a planet. 
To make the problem as simple as possible, we will assume that the planet is a perfect 
sphere (which is certainly not an unreasonable assumption) with radius R and total mass 
M. This means that we can assume that the metric is spherically symmetric. 

Furthermore we will assume that the solution is static - this means two things. First, it 
means that the metric doesn't depend explicitly on time: 

dogf^u = (6.5.7) 

Second, it means that the goi components must vanish: 

goi = gio = o (6.5.8) 

(if these components were non-zero then spatial displacement would involve temporal 
displacement, which would contradict the assumption that our solution is static. 

Because we are working with a spherically symmetric metric we will work in spherical 
coordinates. The general form of the metric in spherical coordinates is 

ds^ = -dt^ + dr^ + r'^ide'^ + sin^ (6.5.9) 

In order to preserve the spherical symmetry, the most general ansatz we can choose is 

ds^ = -a{r)dt'^ + b{r)dr^ + c{r)r'^{de'^ + sin^ Odcj?) (6.5.10) 

We can replace r by any function of r without disturbing the spherical symmetry, so we 
have the freedom to set c(r) = 1 without losing any generality. So, our ansatz is 

ds^ = -a{r)df + b{r)dr^ + r\de^ + sin^ ed(f)^) (6.5.11) 

However, it will be convenient for reasons that will be clear later if we make the replace- 
ments a(r) = e^'^^^'^ and b{r) = e^'^^^\ making our ansatz 

ds^ = -e^^de + e^^dr^ + r^dO^ + sin' Odcj)'') (6.5.12) 



We can plug (6.5.121 into the equations for the connection, Riemann tensor, and then 
Ricci tensor (a tedious exercise we leave to you, but encourage you to either work out by 
hand or at least write a computer program to do - it is more instructive to work this out 
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than you might think), results in the Ricci tensor having the following non-zero compo- 



nents, which when combined with (6.5.51 gives the following differential equations (the 
prime represents a derivative with respect to r): 



Rii 

R22 
R33 



2X' 



2v' 



r 



// \ / / I '2 -■ ■ n 
V — Xv + V = 

r 

~2A 



l + rv' - rX')e~^^ -1 = 
22 i 



R22 sin^ ^ = 



The first of ( 6.5.13[ > implies 



2u' 



so adding this to the second of (6.5.13 1 then gives 



u" - u" + X'v' - AV - z/'2 + z/'2 - -(z/' + A') 

r 

z/' + A' = 



Plugging this into the third of (6.5.13 1 gives 

(1 + ru' - rX')e-^^ 



(1 + rv' + rv')e 
(1 + 2ru')e 



-2A 



1 = 
1 = 



-2A 



Next, looking at (6.5.15) we have 



I/' + A' = 

=^ V + X = constant = C 
This means that the coefficient of, say, qqq in the metric can be written 



(6.5.13) 



(6.5.14) 



(6.5.15) 



(6.5.16) 



(6.5.17) 



(6.5.18) 



However, we assume that as r 00 the metric should reduce to the Minkowski metric. 
If we take A — as r — > 00, we get that C must be 0. So, 

jj + X = C = =^ z/ = -A (6.5.19) 



Plugging this into (6.5.161 gives 



1 + 2ru')e^'' = 1 



d 



dr 
re 

„2u 



re^") = 1 

^•^ -r + A 
A 



1 + 



(6.5.20) 
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where A is a constant of integration. Then, using (6.5.191, 

A 



-.-2X 



,2A 



1 + 
1 



(6.5.21) 



So, finally, the metric ( |6.5.12[ ) is 

ds^ = - 



A 

r 



A 



(6.5.22) 



This solution is called the Schwarzschild Metric. The first thing to notice about it is that 
in the limit where r — > oo (as we get infinitely far away from the planet), we have ^ — > 0, 
and therefore (6.5.221 reduces to the Minkowski metric sufficiently far from the planet - 
exactly as we would expect. 

6.5.3 Correspondence of Schwarzschild and Newton 

So what is the meaning of ( 6.5.22[ >, and what is the value of Al To see this we need to 
review some introductory physics first. Recall that Newton's law of gravitation says that 
the gravitational force between a body of mass M and a body of mass m is given by 



G — 



(6.5.23) 



where G is Newton's gravitational constant and f is a unit vector in the direction of the 
vector between the objects. Or, the gravitational field due to a body of mass M is 



(6.5.24) 



where a is the acceleration an object in the field of M will experience. We can also express 
this in terms of the gravitational potential $, 



where 



-G- 



.M 



(6.5.25) 



Now consider a particle in the Newtonian limit - moving very slowly in a gravitational 
field weak enough to be approximated by a perturbation from 1]^,^, and where the metric 
is static (unchanging in time). The assumption that the particles are moving slowly means 
that if we are parameterizing the spacetime coordinates of the particles with r. 



dx"^ dx^ 
dr dr 

We therefore take ^ = 0. Now the geodesic equation will be 

d^xf" 



(6.5.26) 



dr^ 



+ 00 



dr 







(6.5.27) 
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Now, the assumption that the metric is static (and therefore doQ/j^i, = 0) allows us to sim- 
plify the connection coefficient F; 



00- 



00 



^9^^{do9\o + dogox - dxgoo) 
-Ig^'dxgoo 



(6.5.28) 



Next, we use the assumption that the gravitational field is weak enough to be approxi- 
mated by a perturbation from the flat metric - in other words we write 



flU 



(6.5.29) 



where h^i^ is small {\h^^\ << 1). 



Now plugging all of this into (6.5.28 1 we have 

1 



^ 00 



00 



(6.5.30) 



where we neglect the h^"^ term because, again, << 1. 



The geodesic equation (6.5.271 is 

d'^x'' 1 



fdx'^y 



(6.5.31) 



We have assumed that the metric is static and therefore its time derivative vanishes. This 
means that the /i = component of this equation is 



~d^ 



( dx^ 



0\ 2 



which implies 



dx"^ 
dr 



\ dr 



constant = C 







(6.5.32) 



(6.5.33) 



Now consider the spatial part of (6.5.31 1. The spatial part of the Minkowski metric rjfj_^ 

(6.5.34) 



is just the identity matrix, so (6.5.31 1 is 

d'^x^ 1 



'jim 



'i'l'OO 



Ot^ 2 

If we choose our r parameterization so that x° = r, we get C = 1 and this is 

d'^x' 1 ^ , 
-dih 



d{x^f 



'i'tOO 



(6.5.35) 



We can recognize the term on the left hand side of this as the acceleration the particle will 
undergo and therefore we set 



d^x^ 
d{x^f 



(6.5.36) 
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and so 



(6.5.37) 



Comparing this to ( 6.5.25[ >, we see that we should identify 



'-00 



And therefore (finally), by (6.5.291, 



5'oo 



-2$ = 2G- 



Voo - 2$ 
+ 



(6.5.38) 



- 1 - G- 



2M 



(6.5.39) 

where M is the mass of the planet. Obviously comparing this to our result for the 
Schwarzschild metric in equation ( 6.5.22[ > shows that it is identical (we could have done 
similar calculations to get the radial metric coefficient as well). So, we can see that the 
value of the integration constant in (6.5.20) is 

A = -2GM (6.5.40) 

So, the true Schwarzschild metric for the vacuum around a planet of mass M is 



2M\ 



r J 



2M 



-1 



(ir2 + r2(d^2^sin2 Qd(^^) 



(6.5.41) 



where we have gone back to units where G 



6.6 Geodesies of the Schwarzschild Metric 

Now that Einstein's equations have given us the vacuum metric solution for a spheri- 
cally symmetric static massive body, we can get down the more interesting work of seeing 
what this metric implies by plugging the metric into the geodesic equations. We will look 
at two examples of how an object will move under the gravitational influence of such a 
massive body, including motion around black holes and the perihelion of Mercury. 

The fact that "spherically symmetric" and "static" is a reasonable approximation for a 
great deal of what we see in space lends itself to the enormous explanatory power of the 
Schwarzschild metric, and the examples we will look at provide illustrations of some of 
the early experimental successes of Einstein's theory. 
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6.6.1 Black Holes 



There is one quirk that can be immediately seen the form of the Schwarzschild metric 
- namely there is a radius at which there is a singularity. Note that when r = 2M 

fl-oo = 

gn = oo (6.6.1) 

Furthermore, note that for r > 2M the metric has the normal Minkowski (— , +, +, +) sig- 
nature. In that the sign is the relativistic distinction between a "time" dimension and a 
"spatial" dimension, this is a necessary feature. However, on the other side of the singu- 
larity (r < 2M), the time dimension becomes positive and the radial dimension becomes 
negative - in other words the time dimension "becomes spatial" and the radial dimension 
"becomes time-like". 

What would this value be for, say, the earth? Restoring all of the constants to standard 
units, the goo coefficient is 

(6.6.2) 



c^r 

So, the actual "singularity" radius is 

2GM 2(6.67 xl0-i^^)(5.97xl0^^fc^) 

— = (2.99 X lO^^y = " ^ (^•^•3) 

which obviously isn't a substantially large radius, despite the size the earth. We don't 
have to worry about running into this radius accidentally. For the sun, the value is about 
2.98 X lO^m, or a little less than 2 miles. But given that the radius of the sun is more than 
432, 000 miles, we don't need to worry about running into this radius, either. 

But you can imagine a body with much, much greater mass than the Earth, in which 
case the radius would be much greater. And if the mass density of the object is great 
enough, it is possible that this "singularity radius" is greater than the radius of the body. 
Such an object is called a Black Hole, and the radius we have been calling the "singularity 
radius" is called the Event Horizon. 

So what are the properties of black holes, and how do things move near them? We 
can find this by considering the geodesies of the Schwarzschild solution both inside and 
outside the r = 2M (with natural units again) event horizon. 

Let's start outside the event horizon with a particle falling into the black hole radially 
(so that d(f) = d9 = 0). We'll consider an observer watching this happen from a large 
distance away. The observer will parameterize what he sees with the parameter r. To 



find the geodesies we use the geodesic equation (4.2.69). Starting with the time (or zero) 
component, this is 

d'^x^ n dx'^ dx" 

h r° = 

rfr2 ^ dr dr 
. d'^^^ 1 fco 9gv 



rfr2 ^ 2^ \^x^' 
d^x^ , 1 _oo / ^9v0 



dgkt. 


dgf,u^ 


\ dx^ dx" 


dx^ 


dx^ ^ 


1 dr dr 


dgo,. 


dg^^^ 


L dx^ dx"^ 


dx" 


dx^ ) 


' dr dr 



*^+2«laS? + ^-^l^^-° (6.6.4) 
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The third term always vanishes because there is no = t dependence in any component 
of the metric, and the first two terms are only non-zero when ^fQo is involved and when 
the derivative is with respect to = r. So, the geodesic equation is 



00 



dgoo dx^ dx^ dgoo dx^ dx^ 
dx^ dr dr 



dr^ 

dr^ 
d'^x^ 



+ 9 



+ 9 



dx^ dr dr 
00 (9^00 dx^ dx^ _ 

dx^ dr dr 
oodgoo dx^ 







+ 



dr dr 
1 dgoodx^ 



dr'^ goo dr dr 
d^x^ dgoo dx^ 



900 
d 

dr 
9oot 



dr^ 



dr dr 









5'oo 



dx° 
dr 



(6.6.5) 



where k is some integration constant and the dot represents a derivative with respect to 
T. Or writing this out. 



- 1 



2M\ . 



t = k 



(6.6.6) 



We can then assume that the parameter r is such that the norm of the four-velocity 



dr 



has unit length: 



9oo{v'>f + gniv'f = : 

i9ooV°y + 9oo9iiiv^)'^ 
u2 r.M2_. 2M 



5'oo 



(6.6.7) 



where we have recognized i = and used the exact form of the metric to see that googn 
— 1 (cf (6.5.41 1). Rearranging this. 



V 



dx^ 



P - 1 + 



2M 



(6.6.8) 



Now, the quantity that is of interest to us is the "r velocity", or the derivative of the 
r = x^ component with respect to the t = x^ component, % = But, we can write this 
as 



dr 
di 



(dr/dr) 
(dt/dr) 



r 
i 



V 



(6.6.9) 



Plugging in (6.6.61 and (6.6.81 this is 



A;2 - 1 + 



2M 



dr 
di 



(6.6.10) 
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Rearranging this. 



dt 
dr 



-k 1 - 



2M 



2M\ 
r J 



-1/2 



(6.6.11) 



Consider the behavior the observer will see when the particle gets close to the event hori- 
zon, so that r = 2M + e (it should already be clear that the first term in big parentheses 



above is singular in the limit e — > 0). Expanding (6.6.11 1 to first order in e gives (sparing 
the tedious Taylor expansions). 



dt 
dr 



k 1 - 



2M 
2M + e 



1 + 



2M 

2M + e 



-1/2 



2M 



2M 



r-2M 



(6.6.12) 



Integrating this. 



dt = -2M / dr 



r-2M 

t = -2Mln(r - 2M) + Const. 



Graphing this gives (for r > 2M) 



(6.6.13) 



r=2M 
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So, as r ^ 2M, we have t — > oo. In other words, to the observer watching the particle fall 
into the black hole, he sees it take an infinite amount of time for the particle to reach the 
event horizon. He will watch the object fall forever. 

But certainly this must be incorrect! What could cause the object to slow down as it gets 
closer and closer to something that becomes more and more attractive? Intuition (and 
agreement with Newton) would demand that it move faster and faster (relative to the 
black hole, and therefore relative to an observer stationary relative to the black hole) as it 
gets closer and closer to the event horizon. 

The solution to this apparent paradox is in the fact that we are speaking relative to 
what the observer sees. Keep in mind that what the observer sees is the light reflecting 
from the object back to him. However, as the object gets closer and closer to the event 
horizon, the gravitational pull (i.e. spacetime curvature) gets stronger and stronger, and 
therefore the light rays have a harder and harder time getting back to the observer. As 
the observer gets closer to r = 2M, the light is less and less able to get from the falling 
particle to the observer, and therefore the particle appears to take an infinite amount of 
time to fall in. 

But what about an observer falling into the black hole? What will he seelj^ We can 
calculate this by simply considering the geodesic equations not with an arbitrary param- 
eter r, but with the proper time observed by the falling particle. In this case x° = t = r, 
and therefore we have (from ( |6.6.9| >), 



dr (dr/dr) (dr/dr) r . i . ^ ,^ 

— = ^ — - — '- = - — ' — - = - = r = v^ (6.6.14) 
dt (dt/dr) (dr/dT) 1 ^ ^ 



So, 



or 



;^ = -^/F-l + ^ = ^ (6.6.15) 

r dt 



dt /.o 2M^~^/^ 



dr 



r - 1 + (6.6.16) 



If we once again take r = 2M + e, notice that we won't have the singular behavior like 
we did above. Rather than actually using the e approximation we used above, let's try to 
integrate this directly: 

dt = — I dr 



dr, 



r{k^ - 1) + 2M 



(6.6.17) 

^"Assuming he is concerned with mathematical physics while preparing to be crushed to death by a black 
hole. 
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where A and B are defined in the obvious way indicated. The exact form of this integral 
is fairly ugly However, notice that the integral becomes very, very simple if we can take 
A = 0. While this may initially seem like a bit of a cop out, let's consider the physical 
meaning of such an imposition. 

If we are to take A = k"^ — 1 = 0, this is the same as setting k = ±1. Looking back at 
the definition of k in equation ( 6.6.5[ >, it is defined by 



goot = -[l-—y = k (6.6.18) 



r 



where the dot represents a derivative with respect to the parameter r. If we assume that 
the particle starts from rest a long way from the black hole (so that r is large), the value k 
becomes the initial value of the 0, component of the metric. And if we assume that the 
particle starts a long way from the black hole, we would expect it to be the value of the 
flat Minkowski metric - it should be 1. So, not only is A; = 1 a reasonable constraint - it is 
in fact the most reasonable value for k to take. 



With that imposed, our integral (6.6.171 is then 



- I dr*' 



rA + B 



2VB 



+ Canst. (6.6.19) 



When graphed, this gives 
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t 



r=2M 



So, an observer falling with the particle will go right past the event horizon to be promptly 
crushed inside the black hole. 

There is, as you would imagine, volumes more we could say about black holes. But 
because this is merely meant to be an introduction to the basic ideas of general relativity 
(which we are including largely as an illustration of the geometry we considered in the 
first part of these notes), we won't continue our discussion. 

6.6.2 Perihelion of Mercury 

Next we consider another illustration of the applications of general relativity. But in 
order to appreciate this particular application, we first review (what should be) a familiar 
calculation from classical mechanics. The following several pages will be a return to en- 
tirely Newtonian theory - forget about general relativity until we bring it up again. The 
purpose of this is to (later) compare the Newtonian result to its relativistic generalization. 

We could derive the following (Newtonian) result by using Newton's laws directly, 
but instead we will take the Lagrangian approach. The end result is the same. Newton 
proposed the "one over r squared" law for gravitation, where the force on an object of 
mass m by an object of mass M separated by a distance r = rf (where f is the unit vector 
in the direction from M to m in a coordinate system where the center of M is taken to be 
the origin - note however that r should be thought of as a position vector, not merely the 
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radial component of a position vector) is 



F = -^f (6.6.20) 

(where we are still taking G = 1 and the minus sign is because the force on m is towards 
the origin, which is the opposite direction as f). If we take an arbitrary position vector in 
this coordinate system to be r, then Newton's Law for this system is 

—Y — mr (6.6.21) 

Now consider the angular momentum L for this system. This will be generally defined 

by 

L = r X mf (6.6.22) 

where the x means the usual vector cross product in three spatial dimensions. Consider 
the time derivative of L: 

^ — ^(r X mr) —rx mr + r x mr (6.6.23) 

The second term vanishes because any vector crossed with itself is zero. Then, plugging 
in Newton's law for the first term, 

/ Mm \ ,^ , 

rxmr = rxl 1 = ^ (6.6.24) 

because r and f are in the same direction and therefore their cross product vanishes as 
well. So, we have 

so the angular momentum is conserved. This means that, while m may rotate around M, 
it will not leave the plane it starts in. We therefore proceed with the assumption that m 
will move in a plane. 

We will write out the Lagrangian for m in polar coordinates {R, 0) to derive its equa- 
tions of motion. We can write tine potential term for Newton's gravitational force law 
as 

V^-"^ (6.6.26) 



So that 



^ . d ( Mm\ Mm^ . r.r.-. 
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Then the kinetic term will be 



T 



-m 



m{R' + R'(f)') 



So, the full Lagrangian is 



L = T - V = lm{R' + R^') + ^ 
2 K 



(6.6.28) 



(6.6.29) 



We can write out the equations of motion for this, starting with the (f) equations: 

dL 



d dL 
dtd^ 

Integrating the second equation gives 







d 
di 



{mR^ 







R' 



(6.6.30) 



(6.6.31) 



where / is some integration constant. Equation ( 6.6.31[ > is a statement of the conservation 
of angular momentum. 

The R equations of motion then come from 



d dL dL 
dtdR ~ OR 



dL_ 

OR 
d^dL^ 

dt dk 





mRcj)'^ 



Mm 



mR 



R-R^' + § = 



(6.6.32) 



So our two equations of motion are (6.6.31 ) and ( 6.6.32[ >. We can plug the first into the 
second: 



(6.6.33) 



Now, introduce the variable 



u = 



R 



Now, 



• d ^ d (I 



u 



(6.6.34) 



(6.6.35) 
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However, we can re-express this as 



1 du 

V? dt 
1 du d4> 

(90 dt 
1 -^du 

d(j) 
1 / du 

E? dcf) 



:lu' 



,du 



u^ 



-I 



du 



where we used ( 6.6.31| > and (6.6.341. Then, 



R 



d_ 
dt 



u 
u^ 



u^il — 2uu^ 



u^ 



u 



u^ u^ 



2 (du\_ 1 d du 
u^\di) ~^didt 

2 / o_ X 2 



u 



d 



^ d(j)) u^^^ d(j) 



d(t)J 
du du 



2 fdu'^'^ 



d'^u 



Now, plugging (6.6.37) and (6.6.34) into (6.6.33), we have 



dcp^ 



or 



d'^u M 

a0^ + "-7^ = ° 
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Equation (6.6.39) is called the Binet equation, and it is a differential equation for u = 
whose solutions give the equation for the orbital motion R = R{(t>). The solutions ar^^ 

1 M 



R 



U{<j)) 



+ Acos(0 - B) 



R{(j)) P 

where A and B are constants of integration. Rewriting this for R, 



R{cl>) 



M 



M + AP cos(0 -B) 1 + ^ 



cos 



Or, defining L = and E 



Mi 
M 



(6.6.40) 



(6.6.41) 



l + Ecos(0-5) 



(6.6.42) 



You should recognize this as the equation for an ellipse, so long asO<-E'<l. ^£" = 
this is the equation for a circle: 



(6.6.43) 



If E is between and 1, however, the graph is an ellipse along the axis defined by the 
angle B. 




It is relatively straightforward to solve this equation so we leave the details to you. 
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The closer E gets to 1 the "longer" the ellipse gets. The angle B specifies the angle where 
the graph comes the closest to the origin - or the point where the orbiting object m comes 
the closest to M. This location, the point of closest approach, is called the perihelion of 
the orbit. 

So, this shows us that according to Newton's gravitational law, objects orbit planets in 
ellipses. 

For E greater than or equal to 1, the graph is no longer an ellipse but rather is a 
parabola: 




This is the case where m comes close enough to M for its trajectory to be affected, but not 
close enough for orbit or to fall into M. 

In either case, both A and B are constants, the mass of the planet M is assumed to 
be constant, and the value / is constant (because angular momentum is conserved). So, 
according to Newtonian gravitation, m will move through the same elliptical orbit forever 
- it will never leave the plane it starts in and the perihelion will never change. 

But this lead to an observational discrepancy. It was observed that the perihelion of the 
planet Mercury was not constant - its B value was precessing, or rotating around the sun. 
This type of affect could be loosely explained using Newtonian gravitation once the other 
planets were taken into affect - it is possible for the remaining mass in the solar system 
to cause the perihelion of one planet to precess. However, the rate of the precession was 
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nowhere near what Newtonian theory could justify. A number of ad hoc approaches were 
considered, but none were successful. 

However, Einstein showed that general relativity predicts exactly the observed pre- 
cession of Mercury's perihelion. We will now reproduce this calculation. The gravity of 
the sun, which is a spherical body, can be modeled by the Schwarzschild Metric 



ds' = - 1 



2M 



2M 



-1 



(6.6.44) 



In (6.6.5) we showed that one of the equations of motion for a particle moving near a 



Schwarzschild body is 



dr 



1 )t 

r I 







(6.6.45) 



We can use the geodesic equation to find the remaining equations. It turns out that while 
the yu = 1 equation is difficult to compute, it is not necessary. Therefore, we'll start with 
the /X = 2 equation: 



d^x'' 



dx^ dx" 



dr dr 



k2 
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vk 



^ + 2^ 
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d^x^ 
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dx^ dx" 
dr dr 
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dr 



r sin cos I 







Then, finally, the /i = 3 equation: 



(6.6.46) 



d'^x^ dxf^ dx'' 
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d'^x^ 

-d^ + 2' 



k3 



dg 



vk 
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dgk,i 



dgny \ dx^^ dx'" 



^x^^ dx 



dx^ j dr dr 



d 

dr 



'2-2 

r sm 







(6.6.47) 



^^Of course, because the radius of the sun is much, much larger than the radius of its event horizon, we 
won't have to worry about Mercury's orbit approaching it. 
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So, our equations of motion from the metric are 



dr 



d \ 2M 
dr 



[r^9) -r^ sine cos 6 (p'^ = 



dr 



[r sm 



(6.6.48) 



We found that angular momentum was conserved in Newtonian motion, allowing us 
to restrict motion to a plane. Let's see if this is a solution here - consider letting 6 be fixed 
at^ = |. So,^ = oQand our second equation of motion becomes 



dr 



ye) = r'^e + 2rfe = r'^e = o 



(6.6.49) 



We can take further derivatives of this equation to show that indeed all higher derivatives 
of 9 are in fact zero. So, planar motion is in fact possible, and we can take = ^ exactly in 
all that follows. 

This will make the third of our equations of motion 



dr 



(r20) = 



r^(f) = l 



(6.6.50) 



where / is the constant of integration. This is again the conservation of angular momen- 
tum. 

We can also integrate the first of the equations of motion to get 



2M\ 



i = k 



(6.6.51) 



where A; is a constant of integration. 

Now let's consider the motion of Mercury using the proper time, so r = s. This value 



for r will make the 
because we have fixed 9) 



ds 
dr 



1, and therefore we can rewrite the metric as (leaving d9 = 



ds' 



2M 



df 



2M 



dr^ + r^{d9^ + 3111^ 9d(t?) 



2M 



2M 



•2 I 2 12 



(6.6.52) 



Plugging in ( 6.6.51[ > this is 



2M 



-1 



A;^+ 1 



2M 



•2 I 2j.2 

r + r d) 



(6.6.53) 



^^However, = does not necessarily mean that all higher derivatives of are zero as well - we will have 
to determine whether or not this is the case. 
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Now we make the same u = - substitution we did in the previous section (cf (6.6.341). 
Then, using the definition of u along with (6.6.36) and (6.6.371, we can rearrange (6.6.531 
as 



/ du 



. P - 1 2M 
+ = — z — + 



(6.6.54) 



This can't be easily solved, but if we take the derivative of this with respect to we get 



M 



(6.6.55) 



Comparing this with (6.6.391 we can see that the left hand side is the exact same, while 
the right and side has the 3Mm^ term. 

Equation ( 6.6.55[ > is the relativistic version of Binet's equation. Consider the ratio of 
the additional 3Mm term to the constant expression from the original Binet equation: 



— S7/ / — 

M/P r2 



(6.6.56) 



For Mercury this ratio is on the order of 10^ - thus the additional term is very small. 

Because the general relativistic correction is very small we can solve the relativistic 
Binet equation perturb atively. Again sparing the details of solving itj^ we merely give 
the solution: 

R{<t^) = 7=. 7T7 ^ ;^ (6.6.57) 

1 + E cos{(j){l - e) - B) ^ ' 

where e is the small value e = Graphing this gives 




^^Finding the solution is straightforward using standard approximation techniques. 
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As you can see, the motion is still elliptical. However, the (1— e) term has the ejffect of shift- 
ing the perihelion by a small amount on each revolution. The value of this precessional 
shift predicted by general relativity lined up almost exactly with what was observed by 
Mercury's motion. 

6.6.3 Concluding Thoughts on Schwarzschild Geodesies 

As we have illustrated, the true physics of gravity is tied up in the geodesic equa- 
tions. We start with the metric, which defines the geometry of the spacetime, and then 
use the metric to calculate the Levi-Civita connection, which in turn gives us the differen- 
tial equations for the geodesies. These differential equations take the place of Newton's 
gravitational law and Newton's second law (which are also differential equations). We 
have seen that in the limit where gravity is weak, the geodesic differential equations of 
general relativity reduce to Newton's laws. However, for strong gravitational fields there 
are relativistic corrections to the Newtonian equations. 

So, whereas in Newtonian physics, Newton's laws provide the differential equations 
of motion whose solutions define the behavior of a physical system, in general relativ- 
ity the Levi-Civita connection (as defined by metric) takes over to define the geodesies. 
And whereas in Newton's laws the form of the differential equation came about from 
observation (i.e. F — G^), general relativity provides a way of calculating it a priori - 
using Einstein's field equations G^i, = kT^^^. While Einstein's field equations can rarely be 
solved directly, the use of an ansatz (an educated guess) can make solving them possible, 
resulting in the metric in terms of the energy distribution in a region of spacetime. And 
once we have the metric, we are able to calculate anything else we want. 

6.7 Cosmology 

d 

6.7.1 Energy Momentum Tensor for the Universe 
d 

6.7.2 General History of the Universe Through Present 
d 

6.7.3 Observable Matter and Dark Matter 
d 

6.7.4 Cosmological Constant and Dark Energy 
d 
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6.7.5 The Future of the Universe 



d 



6.8 General Relativity as a Gauge Theory 

Before concluding this chapter we would like to tie in everything we have done with 
general relativity into our overall agenda with this series of papers. One reason for this 
is that our larger goal is to understand gauge theories. The idea of a gauge theor}|^ is 
that given some Lagrangian £ with some global symmetry, gauging the transformation 
destroys the symmetry. Restoring the symmetry requires the introduction of gauge fields 
Afj, which don't transform tensorially. Through minimal coupling we introduce the gauge 
fields into our Lagrangian by replacing partial derivatives with covariant derivatives con- 
taining the gauge fields]^ Then, so that the gauge fields don't become merely background 
fields that decouple from all interactions we introduce a field strength term F^„. 

A second reason to look at gravity as a gauge theory is that eventually we will want 
to talk about quantum theories of gravity. And, as discussed in [40J, writing out a La- 
grangian for a theory is the first step in quantizing it. So, having a Lagrangian for general 
relativity will be a necessary tool when we get to theories of quantum gravity. 

6.8.1 Relationship Between Gauge Theory and Geometry - A First Look 

First of all, we're calling this section merely "A First Look" because the true relation- 
ship between gauge theory and geometry is one of the most profoundly deep and rich 
fields in all of mathematical physics. In fact, this entire paper is merely a primer for one 
to begin learning how gauge theory and geometry converge. We will have more to say 
about this in the last chapter of this paper, and the next paper in this series will be a 
much, much more detailed treatment of geometrical gauge theory. 

Before moving on with gravity as a gauge theory, we'll spend some time considering 
how gauge theories are generally set up for non-gravitational forces (the standard model 
forces). This will guide our intuition for gravitation as a gauge theory. We'll do this 
by first reviewing the geometry of this section, and then comparing this to how gauge 
theories were set up in [|40l . 



Review of Differential Geometry 

In chapter [Ij after introducing metrics, we discussed the notion of parallel transport]^ In 
order to define the derivative of a tensor field we must have some notion of what "paral- 
lel" means. In other words, we must be able to compare tensors in two different tangent 
spaces in order to define a derivative in a meaningful way. This problem was manifested 

^^Review f40\ if you're not familiar with this. 

^^All of this should have a radically deeper meaning to you after having read the contents of this paper 

so far. 

^^You are encouraged to go back and re-read section 4.2 at this point. 
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by the fact that partial derivatives don't transform in a tensorial way (cf equation (4.2.2 1): 



dx'' dx'^ ( dv 



dx'j dx^ V dx 



(9x^ d'^x'' 
dx'i dx^dx^ 



+ 



(6.8.1) 



Notice that the first term is in fact tensorial while the presence of the second non-linear 
term makes this transformation non-linear/non-tensorial. 

We can rewrite this in terms of the transformation we are actually invoking: 



dx^ 
dx'^ 



(6.8.2) 



where the inverse is 



h 



dx''' 
dx^ 



(6.8.3) 



In terms of this transformation, (6.8.1 1 is 

dv'- 



[dx^J 



dx^ 



(6.8.4) 



In order to correct this nonlinear term, we introduced a connection, F*^ which we de- 
fined to have a certain transformation law: 



dx'^ dx dx^ 



d X dx'^ 



dx^ dx^ dx"- dx'dx^ dx" 



(6.8.5) 



Again, the first term in the transformation law is tensorial. The second, however, is a 
non-linear term which makes the connection non-tensorial. We can of course re-write this 



as: 



(6.8.6) 



Comparing (6.8.41 and (6.8.61 we see the non-linear terms are identical (except for the 
V in ^K^ ). 

Next, in order to make a derivative that does transform in a tensorial way (and there- 
fore has a well defined way of comparing tensors in different tangent spaces), we replaced 
the partial derivative with the covariant derivative (cf equation ( 4.2.37[ > ff) 



(9f * 
dx^ 



, T^i k 

dx^ 



(6.8.7) 



Then, using the transformation laws (6.8.41 and (6.8.61, it is straightforward (though te- 
dious, cf equations ( 4.2.53[ > and (4.2.541) to show that 



dx^ 



dv'' 
dx°- 



dx"- dx'^ ( dv 



dx'^ dx'' \dx 



+ T'v'' 



(6.8.8) 
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which has exactly the tensorial form we would expect. So, by introducing the connection 
r*;, we have preserved the tensorial nature of the derivative, and in doing so allowed a 
derivative that compares tensors in different tangent spaces in a meaningful way 

So, to summarize, the problem was that partial derivatives don't transform tensorially 
in a geometrically non-trivial space: 

dv^ dx'^ dx'^ dv'- ^ dx^ d'^x''^ i (6 8 9) 

dx^ dx'^ dx'- dx^ dx'i dx^dx^ 

To fix this we define the connection V^-^ which also transforms in a non-tensorial way: 

q dx'P dx'^ dx^ I d'^x''- dx'^ 

' dxi dx' dx'^ ^ dx'dxi dx'^ (•• ) 

And then, forming the covariant derivative 

D,v^ = — + T],v' (6.8.11) 
we find that Djv' transforms in a tensorial way. 



Review of Gauge Theory 

We now revisit what we did what gauge theories in [40]. You no doubt suspect that there 
is a strong correlation between the geometry of this paper and the physics of the previous 
- if for no other reason than the fact that we have things called "covariant derivatives" in 
both places. 

In a gauge theory we begin with a Lagrangian C (or =Sf ) that possesses some global 
symmetry, (p Ucp where U is some transformation matrix. The transformation may be 
a spacetime transformation, like a translation, rotation or a Lorentz boost, or it may be 
a gauge transformation, like U{1), SU{2), SU{3), etc. In either case, the field is free to 
"move" through some space, whether spacetime or gauge space. The only difference is 
that spacetime is considered an external degree of freedom where the field can move, and 
the gauge spaces are considered internal degrees of freedom where the field can move. 
Keep in mind that gauge groups are Lie groups, which are groups that correspond to a 
manifold - a geometrical space. So, on one hand the field can move around spacetime, 
but on the other hand it can "move" around the gauge space in the exact same way. 

It is in this sense that we must realize that gauge theory demands the introduction (or, 
the assumption) of some very elaborate geometries. For example, consider a field whose 
Lagrangian is symmetric under a f/(l) transformation, like the Lagrangian for a Dirac 
spinor (i.e. an electron). The field is certainly able to move around spacetime (translations, 
rotations, boosts), but it can also change its phase. And because the gauge/Lie group 
f/(l) is topologically/ geometrically a unit circle in the complex plane, we can envision 
the true space the physical field is allowed to move in as not merely the familiar 1 + 3 
dimensional Minkowski spacetime Ai'^, but actually the more elaborate space Ai^ ® S^. 

So, the degrees of freedom of the field are spacetime translations, spacetime rotations, 
spacetime Lorentz boosts, and U{1) rotations. And in demanding the theory to be rel- 
ativistically invariant, we are demanding that the Lagrangian be invariant under global 
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translations, rotations, and Lorentz boosts. Consider the case of making these local - this 
means that the translation, rotation, or boost may change from point to point in space- 
time. It should therefore be plain that the difference between a global translations, rota- 
tions, and boosts and a local translations, rotations, and boosts is exactly the difference 
between inertial frames and non-inertial frames! If the way the field translates, rotates, 
or boosts depends on the spacetime location, that means that it may be changing speed, 
direction, etc. So, gauging the spacetime symmetries of the field is identically what we 
need to make the jump from special relativity to general relativity (more on this later). 

And the same is true with making a gauge symmetry local. The field can be trans- 
formed through its U{1) space globally, which means that the field is altered the same at 
every point in space and time. By gauging the U{1), we are essentially allowing the field 
to be "non-inertial" in the part of the space it is free to move in. This is precisely the 
geometrical meaning of gauging a symmetry. 

And, whereas making the spacetime degrees of freedom local allowed for non-inertial 



frames which made the spacetime geometry non-flat (cf section 6.4.1 1, thus demanding 



the introduction of a connection (cf the previous section as summarized in equations 



(6.8.9)-(6.8.11 1) and curved geodesies (cf equation (6.4.21) at the end of section (6.4.11), 
gauging the gauge symmetry allows for "non-inertial" frames in the Lie space, making 
the gauge geometry "non-flat", thus demanding the introduction of a gauge field (which 
is identical to a connection). And because the gauge fields play the role of force carrying 
particles, they do indeed alter the geodesies of the field. 

So, with gauge theory, we started with a Lagrangian C that was invariant under a 
global Lie group (G) symmetry. This is essentially demanding that all "inertial" frames in 
^A'^ ® G are equivalent. By gauging the symmetry we are allowing "non-inertial frames" 



in the internal degrees of freedom G. This, as expected (by analogy with (6.8.91), results 
in a non-linear transformation of the Lagrangian, which demands that we introduce a 
connection. Mathematically, it is identical to the connection we discussed in chapter |4| 
physically it is a gauge field. 

To illustrate this with a simple example, consider the process of making the U{1) local 
as in BOl . We begin with the Dirac Lagrangian 

C = iPiiYdf, - m)i) (6.8.12) 

which has a global U{1) symmetry: 

i) — > e*"V and — ^ ^e"^" (6.8.13) 

so under this transformation, 

£ = ilj{i-f^'df, - m)ilj — > i^ii^d^ - m)ip (6.8.14) 

Making the symmetry local had the result of adding a non-linear term when the transfor- 
mation is carried out: 

C = ip{i'-)^dfj_ - m)4) — > 4){i'j'^dfj_ - m - ■j'^dfj,a{x))ip (6.8.15) 
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To correct this, we introduced a gauge field which we defined to transform under 

U {x) = e^°'^^'> according to 

— > U{x)A^U-\x) + iU{x)d^U-\x) = Ay.- a^a(x) (6.8.16) 

We introduce the gauge field through the covariant derivative: 

Dy = dy^ lAy (6.8.17) 

and by replacing the partial derivatives in the Lagrangian with covariant derivatives, we 
restore the symmetry: 

C = tpii^Df, - m)i) — ^ i!{il^Dy - m)V^ (6.8.18) 

So, as outlined above, the role of the connection in non-flat geometries is absolutely 
and in all ways identical to the role of the gauge field in particle physics. Comparing the 
relevant equations, with geometry we had partial derivatives transform non-tensorially 

£ + ^'^/^.(t/-^)? (6.8.19) 



whereas with gauge theory we had a non-tensorial term in our Lagrangian (6.8.15): 

i'ih'^dy - m)4) — > 4){i'y'^df, - m - d ya{x))ii (6.8.20) 

Notice that the non-tensorial term in both these expressions is essentially the same (other 
than some constants)]^ 

Then, with geometry we introduce a connection that also transforms non-tensorially 
( |6X6l ): 

{U-'r,{U-'tUfTi^ + Ufd,{U-')] (6.8.21) 

whereas with gauge theory we had the non-tensorial gauge field which transforms ac- 
cording to (6.8.16) 

Ay — > U{x)AyU-'^{x) + iU{x)dyU-^{x) (6.8.22) 

Again, notice that the non-tensorial term in both cases is the same (other than some con- 
stants) - UdU'\ 

Finally, with geometry we build the covariant derivative as 

D,v^ = — + T],v' (6.8.23) 



^^This is more clear if we note that in the first equation, U (like all transformations) can be written as the 
exponentiation of some generators T, U{x) — e*"(^)^, so UdU^^ will be 

UdU-^ = g»a(x)T^g-»a(x)T ^ e'"(^)'^e-*"(^)^(-i9a(x)T) = -iTda{x) 

where T ~ \ for the U{\) example we are working with here, because the generator for the Abelian U{\) is 
simply the identity. 
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whereas with gauge theory we had 

D^<P = d^cp + iA^(t> (6.8.24) 

where and F*^ play exactly the same roles. 

All of this was to provide the beginnings of a link between differential geometry and 
gauge theory. The purpose of spending several pages on this, however, was not really to 
enlighten you to the geometrical aspects of gauge theory (although we hope we have at 
least begun to do that), but rather to provide some intuition on how we may proceed with 
making general relativity a gauge theory. As we said above, we will spend considerably 
more time talking about the geometry of gauge theory. 

But we have gone as far as we need to go down this road (for now), and we return to 
general relativity as a gauge theory. 

6.8.2 Gravity Coupled to Scalar Fields 

We're going to begin this topic by coupling gravity to scalar fields. While not partic- 
ularly interesting physically, this will provide a nice introduction to the topic of gravity 
as a gauge theory, and it will be a nice stepping stone for when we begin to study gauge 
theories from a geometrical context later. 

We'll begin with a real scalar field with Lagrangian 

£ = -]^d^ct>d^ct> - ^mV' (6.8.25) 



As we mentioned in section ( 6.3.4[ >, the idea underlying general relativity is that the space- 



time metric becomes a dynamic field along with (p. This means that we don't have our 
spacetime geometry build in a ■priori. We therefore define the action not as the integral 
merely over dtdxdydz, but over the invariant volume element dtdxdydz \/Jg\- We do this 
by modifying our Lagrangian from £ to ^ where 

if = £ = - ^ ^/\g\{dyd^(p + m202) (6.8.26) 

So, our action is 

S = dtdxdydz^ = / dtdxdydz ^/\g\C (6.8.27) 



We will, as in section (6.3.41, work with =Sf as our Lagrangian. 

We indicated above that the role of the gauge group in general relativity is the col- 
lection spacetime translations, rotations, and Lorentz Boosts. Obviously our scalar La- 
grangian here is indeed invariant under such (global) transformations. Following what 



we usually do for gauge theories, we would gauge the symmetry in (6.8.261, introduce a 
gauge field through a covariant derivative, and then include a kinetic term for the gauge 
field. But, it is in trying this that we run up against a small barrier - the scalar field cj) 
transforms trivially under translations, rotations, and boosts - it has no spacetime indices. 
So, the covariant derivative is (boringly) equal to the partial derivative: 

= (6.8.28) 
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However, as we have said, general relativity consists of making the spacetime met- 
ric a dynamical field. And, because we are working with ^ instead of L, our matter is 
certainly coupled to this new dynamical field ((7/^1^)1^ So while we're off to an uninterest- 
ing start, we can still proceed because it is clear that our action has no term containing a 
derivative of g^j^y, and therefore we have the same problem we had with the gauge fields 

- it has no dynamics. The equations of motion prevent the metric from being a dynam- 
ical field, and because the whole point of general relativity is that the metric is a dynamic 
field, we should follow what we did in |40|| and add a kinetic term for the metric. 

But we must be careful. Keep in mind that there is a fundamental difference between 
the fields of the standard model and the field g^^ - namely that is a field existing in 
spacetime, whereas g^^ is spacetime itself. This is our first indication that mirroring what 
we did for the standard model forces may not be our best bet. 

What we need is a good guess for the kinetic term for g^^. First of all, it must contain 
derivatives of the metric in order to be a kinetic term. Second, because we are forming 
a Lagrangian, it must be a scalar. So, what we need is a scalar that contains geometric 
information and also consists of derivatives of the metric. 

Having gone through the contents of this paper, coming up with a guess should be 
easy! The most obvious guess for a scalar that has derivatives of g^^ and contains geo- 
metric information is the Ricci scalar, R = g^'^R^u, a quantity naturally provided by differ- 
ential geometry. So, our guess for the term we add to the Lagrangian to give the metric 
dynamics is 

= ^\R (6.8.29) 
where the H in the subscript stands for Hilbert, who was the first to propose this action. 



Now, if we denote (6.8.26) the "matter Lagrangian", our total action is 



S = I dtdxdydz{^M + i^-^h) 

dtdxdydz^\( - ^9^09^0 - ]^m^<p^ + kR ) (6.8.30) 



where k is some constant of proportionality. 



From sections 6.3.4 and 6.3.5 (specifically equation (6.3.811) we know that the varia- 
tion of the matter part of this action with respect to the metric will result in the energy 
momentum tensor. So what will the variation of R with respect to g^y give? 

We ultimately want to find 5Sh, where 

S„ ^ j dU.dydz^„ ^ j d'.^„ (6.8.31) 

and 

SH[g^lu] — > Snlgf^u + Sg^,u] = Snlg^iu] + SSnlg^Lu] (6.8.32) 



29, 



Of course, the inner product in d^^(t>d^ = g ^^d^ (pd'^ (j) also couples to the metric. 
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so that 



(6.8.33) 



So, writing this out, 

5Sh 



\9\R) 



glR.Jg'"' 



glg'^'SR 



(6.8.34) 



We'll start with the last term, Recall that the Ricci tensor R^i, is a contraction of the 
Riemann tensor, so 



5R 



(6.8.35) 



Then, using the definition of the Riemann tensor (4.4.6 1, this is 

^Rp = SO V — d V A-V T"' — f V ) 

= d,{6np) - ^.(^r^;.) + T^pA^Kp) + {srpKp - KAS^^pp) - i^K^Kp 

(6.8.36) 

The obvious step at this point would be to write out the connection coefficients in terms 
of the metric gi_i_i, to get the variation. This approach works just fine, but instead we will 
take a simpler (and more clever) approach. 

Consider the variation of the connection F^^,. By definition this comes from 



And so 



S'^pulgap] = TP [g^p + 5g^ 



Jaf3\ 



^pulga^ 



(6.8.37) 



(6.8.38) 



Now, while it is true that F^^ is not a tensor, we pointed out in section 4.2.6 that the 
difference between two connections is a tensor. We can therefore take (5F^^ to be a type 
(1, 2) tensor. And because it is a tensor, we can take a covariant derivative. Using the 
generalized form of the covariant derivative, equation (4.2.421. This is 

V,{6Tp = d,{6Tp + F^^(5F^J - T;^{6Tp,) - F;,(5F^J 

Now consider the difference in two such terms: 



(6.8.39) 



V,(5F^J 



V^(5F^J 



-t;^{6tp,) + F^,(5F^j - f;,(5f^j + f;:,(5f^j 



(6.8.40) 
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Comparing this to (6.8.361 we see that they are the same! So, 



6R 



(6.8.41) 



So, the last term in (6.8.341 is now 



J d^x^/Wlg'^SR,. = /rf'^v^^?^"(v,(5r^J-v.(5r^^)) (6.8.42) 

Then, using the fact that the covariant derivative of the metric vanishes by definition (cf 



equation (4.2.108 », this is 



(6.8.43) 



This is an integral over a total (covariant) derivative and therefore is equal to a boundary 
term which we can take to be zero (as we always do in physics). 



So, the expression we are trying to find, (6.8.34), is now 

6Sh = J d'^xig'^'R^Jy^l + y^lR^Jg 



We now work with the first term. Evaluating this requires the use of (6.3.79 1: 



dg. 



\9\9 



al3 



from which it is clear that 



•^V 1^1 = 2V 1^1^' 



'^5g^f 



(6.8.44) 



(6.8.45) 



(6.8.46) 



So, we can now write (|6.8.44|) as 

5S 



H = J d'x(^g^''R,,^y^\g''^6g^p + ^\R,Jg''^^ (6.8.47) 
Notice that the metric variation part of the first term has lowered indices whereas the 



metric variation in the second term has raised indices. We correct this using (4.1.14): 

=^ S{9''9,u) = 

=^ 9^''Sg,, + g^u6g>^'' = 

=^ 9^''5g,. = -9,^69'" 



g^'^g/iu = n 



(6.8.48) 



and therefore, finally, 

5Sh 



d^x(^g^''Rj-^\g^ng^p + ^\R^,5g>^- 



d X 



R\/\9\9cp5g'^^ + \/\g\Rf,y5g 



d''x^\g\5g^'''\R^,--Rg^ 



d''x^\g\5g^'^G^ 



(6.8.49) 
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where G^y is the Einstein tensor (cf equation (4.4.71 )). So, amazingly, the the equation of 
motion for the Hilbert action without a matter Lagrangian is (for arbitrary 5g^y) is 



Gfj,u — 



(6.8.50) 



which is Einstein's equation in the absence of matter, as expected (cf equation ( 6.4.43[ >). 
Or, if we use the full action (16. 8. 301), 



SS = 6 d^x{^M + i^^h] 



dt^xiT^y + kG 



(6.8.51) 



So, if we use (6.4.41 1 to get the appropriate value for k, we have Einstein's field equation 
exactly (cf equation ( 6.4.43[ >): 



G^u — SnT^y 



(6.8.52) 



Thus, we have confirmed that the Hilbert action is the correct term for the dynamics of 
the metric, as well as that 

S = J (fx{^M + n^H) (6.8.53) 

is the correct action for scalar fields and gravitation. 

6.8.3 Non-Coordinate Bases 

We would now like to move on to writing a Lagrangian coupling fermions to space- 
time via general relativity so as to give fermions gravitational interactions. Doing this, 
however, requires a bit of formalism first. 



Thinking back to chapter 2.3 recall that part of our definition of an n-dimensional 
differentiable manifold M. was that, on some patch of the manifold that is (locally) home- 
omorphic to and open subset of M", we can define coordinate functions x which map 
points p & M. into M": 



p e M — > x(j)) e 



(6.8.54) 



Of course it may not always be possible to cover the entirety of Ai with a single coordinate 
neighborhood, and therefore there may be neighborhoods on Ai that are covered by two 
different coordinate functions - say x and y. We then demanded that the function 



y(x ^{p)) : x(jj) — > y{p) 



(6.8.55) 



be infinitely differentiable and have infinitely differentiable inverse. This was the primary 



content of section 2.3.2 



Then in section 2.3.3 and following we discussed the tangent space and cotangent 
space, which were copies of M", which were attached at every point of Ai. We then 
showed that in some neighborhood of Ai (that is homeomorphic to an open subset of M"), 
we can use the coordinate functions to define frames, or bases, for these spaces. Namely, 



(6.8.56) 
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spans the tangent space, whereas 



dx' (6.8.57) 
spans the cotangent space. Of course these bases satisfy 

We have been able to make considerable progress considering such frames, or bases, 
of tangent spaces. However, there is no a priori reason from choosing a tangent space basis 
that coincides with the coordinates in this way. As you should be well aware from linear 
algebra, any basis of M" is just as good as any other, and we are therefore free to choose 
whatever basis we want. 

With that said, we will therefore consider tangent and cotangent space bases that are 
not based on the coordinate functions - such frames are aptly called Non-Coordinate 
Bases. Returning to the notation where a greek index indicates a spacetime index, let's 
make the switch from a basis ^ to some other basis. We'll call the new basis e^, where 
the latin index (instead of a greek index) is because this basis doesn't have any relation to 
the spacetime coordinates. Of course, because all bases are equally good as any other, we 
can write the new basis at any point as a linear combination of the coordinate basis. In 
other words we can write 

e. = e,- A (6.8.59) 

When written this way, the non-coordinate basis has both a spacetime index and a 
non-coordinate basis index (more on this later). The lower latin index labels the non- 
coordinate basis vector, while the upper greek index labels the spacetime (manifold) com- 
ponent of that basis vector. 

We can then demand that the non-coordinate basis be orthonormal at each point ac- 
cording to whatever orthonormal may mean on that manifold. For example, if we are 
working on a space with an Euclidian metric signatur^^ the inner product between any 
two of the non-coordinate basis vectors will be (cf equation ( 4.1.4[ >) 



g(ea,eb) = gf^ue^e^ = e^Cbi^ = Sab (6.8.60) 
whereas if the metric has a Lorentz signatur^^ this is 

g{ea,eb) = g^,ve'^el = (^af^bf, = Vab (6.8.61) 

The nomenclature typically used for such an orthonormal non-coordinate basis is a 
Vielbein. And, just as it isn't always possible to use a single coordinate system to cover 
a manifold and therefore isn't always possible to use a single coordinate basis for every 



^"This doesn't necessarily mean it has a Euclidian metric - it merely means that each of the diagonal 
components of the metric have the same sign, positive or negative. 
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Simply meaning that the diagonal elements don't have the same sign. 
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point on the manifold, it may not be possible to use a single vielbein to cover an entire 
manifold. We will therefore need to talk about transformation laws between vielbein's on 
coordinate neighborhood overlaps. 

Note that a vielbein is a very simple idea - we haven't done anything profound here. 
All we're doing is renaming things, nothing more. Make sure you understand the simple 
nature of what we've done in this section so far before moving on - if it seems complicated 
at all then you're missing something. 

Moving on, we can of course invert (6.8.59 1;^ 



B 

' -l\a H 



_d_ 
dx'' 



(6.8.62) 



where we have introduced the inverse vielbein, which has the spacetime lowered and the 
non-coordinate index raised (the opposite of the vielbein). The inverse obviously satisfies 

-l\b _ xb 



St". 



and 



(6.8.63) 



We can also use the inverse vielbein to invert ( |6.8.60 > and ( |6.8.61 >: 



5ab =^ g,.e^Me-'Ue-% 



9al3 



Sabie-'Taie'X 



(6.8.64) 



or 



-l\a , 



-l\b 



(6.8.65) 



These are extremely powerful relationships between the vielbein's and the spacetime met- 
ric - namely they make obvious that the metric can be written in terms of the vielbein and 
the flat space metric exactly. 

Furthermore, the inverse vielbein allows us to write a non-coordinate basis for the 
cotangent space: 



~l\a 



dx^ 



(6.8.66) 



Or inverting this. 



l\a 



(6.8.67) 



Obviously this preserves the relationship (6.8.58): 



6i: = dxf" 



d 
dx'^ 



lu^b 



St: 



(6.8.68) 



^^This of course assumes that e{J, which forms an n x n matrix, is invertible - we will assume this from 
now on. 
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We will from now on drop the —1 notation from the inverse vielbein. It should be 
understood that when the greek index is lowered and the latin index is raised it is inverse, 
whereas when the greek is raised and the latin is lowered it is not the inverse. 

Next, note that we can express any arbitrary vector in terms of the vielbein. Consider 



the vector (in terms of the coordinate basis) v = v^J^. In addition to switching the 



coordinate basis as in ( 6.8.62[ >, we can write the components of v as 



v^" — >v'' = e^v" (6.8.69) 

The vector is then 

V = V^Ga (6.8.70) 

Of course plugging in the known values for each of these terms recovers the original 
vector: 

V = v^Ba = v^elel— = v>'— (6.8.71) 
We can do this same thing for an arbitrary tensor. For example, we can write 

TZ-- = ■ ■ ■ <4 ■ ■ ■ T^::: (6-8.72) 

We want to reiterate at this point that there is nothing profound or even particularly 
interesting about what we are doing. We are merely relabeling things - nothing more. 

However, we are now able to begin to get into the real meat of using a non-coordinate 
basis, or a vielbein - transformation laws. What is particularly interesting is that, because 
the vielbein is completely independent of the spacetime manifold coordinates, it can be 
transformed independently of the spacetime coordinates. In other words, we can trans- 
form the latin indices without worrying about the greek indices. The meaning of this 
is that we're changing the vielbein without changing the coordinate functions. Keep in 
mind that a vielbein makes no reference whatsoever to the spacetime manifold coordi- 
nates. And therefore at a given point on the spacetime manifold, in choosing a vielbein, 
we are choosing a completely arbitrary set of basis vectors that don't have anything what- 
soever to do with the coordinates. Therefore, changing, or transforming, to some other 
completely arbitrary set of basis vectors that also have nothing whatsoever to do with the 
coordinates doesn't require that we think about the coordinates. In a sense, by working 
with the vielbein we have "lifted" ourselves from the manifold into another space. 

To see what this space i s, keep in mind that the only rule we are imposing for the 



vielbein is equation ( 6.8. 61p - the vielbein must be orthonormal at each point. So, the 
only constraint is that the vielbein we transform to must be orthonormal. Therefore, we 
can make absolutely any transformation on the vielbein that preserves the relationship 



( 6.8.61 ). In other words, we are allowed any transformation that preserves the Minkowski 
metric - and we know exactly what types of transformations these are - Lorentz transfor- 
mations! So, given any vielbein at any point (which, of course, has nothing to do with the 

^^From now on we will not make reference to the Euclidian signature version, but rather talk about the 
Lorentz signature version only. The Euclidian analogue is obvious. 
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spacetime coordinates at that point), we can transform that vielbein to any other vielbein 
as long as the transformation is a Lorentz transformation on the latin vielbein indices. So, 
denoting Lorentz transformations on the vielbein as A^, this means 

e„ ^ e; = Ale, (6.8.73) 



and in order to preserve (6.8.63 1 we have 

e'^ — . e"^ = (yl-^)^e^ (6.8.74) 

And, as we have said repeatedly, no such transformation has any affect on the actual 
spacetime coordinates, and therefore no spacetime indices need to be transformed. In 
other words, these Lorentz transformations are completely internal. For example in 



(6.8.62), this internal Lorentz transformation will take 



c 



= ejefe 

= ^ (6.8.75) 

So, indeed it is the case that transformation on this internal space have no affect on any- 
thing in spacetime. Furthermore, it should be clear that the internal space we are working 
with is specifically the space of all Lorentz transformations on the vielbein space! In other 
words, it is as if we have attached a copy of the Lorentz group 5*0 (1, 3) to every point 
on the spacetime manifold, and so by performing a Lorentz transformation on the viel- 
bein at every point, we are assigning an element of the Lorentz group to every element of 
spacetime. In other words, what we have is a local, or gauged, Lorentz group. 

Furthermore, just as the internal vielbein transformations don't have any affect on co- 
ordinate transformations, transformations that take place on the spacetime indices don't 
have any affect on the vielbein coordinates. Speaking more physically, as we have said 
the vielbein at a particular point has absolutely no relation to the coordinates. Therefore 
changing the coordinates should obviously have no affect on the vielbein. Such a space- 
ti me coor dinate transformation will only act on the greek spacetime indices. So, looking 



at (6.8.591, we have for arbitrary coordinate transformation 



dx""- 

„ d ,„ d „dx''^ dx" d 

p/^ i c — o'^ 



e — > e = e 

" dxi" " dx'i" " dx"" dx't" dx"' 

d 

''"dx- 



"dx" 

= ea (6.8.76) 

As interesting as all this may be, it is still the case that we've done little more than 
complicated our notation. We have demonstrated via transformation laws that there is 
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a rich geometrical structure in using the vielbein's (namely attaching a copy of S0{1, 3) 
to every point on the manifold), but anything we can write with spacetime indices we 
can write with vielbein indices and vice versa. However, the real divergence in the two 
notations comes when we try to take derivatives. 

As usual, when working with arbitrary manifolds there is no automatic way to com- 
pare tensors in two different tangent spaces, and we therefore must include a connection 
term to form the covariant derivative. But consider the vector in vielbein coordinates 
rather than spacetime coordinates. This takes on values in vielbein space 5*0(1, 3), not in 
spacetime. Therefore, while the covariant derivative will still need a connection, it clearly 
can't simply be the spacetime metric's Levi-Civita connection F^^. Instead, we introduce a 
connection that "lives" on the vielbein space, cu^^ (note that it has one spacetime index and 
two internal vielbein indices). Because this is a connection with reference to the internal 
space of Lorentz transformations, which are actually just rotations, we call this connec- 
tion the Spin Connection. Then, we take the covariant derivative of v"- using the spin 
connection: 

DX = dX + ^>'' (6.8.77) 



Or, more generally (cf (4.2.421) 



= ^M^:: + ^InJ^^:: + ^nTT' + ■■■- - - ■ ■ ■ (6.8.78) 

Or, if we had an expression with both a greek spacetime index and a latin internal 
vielbein index, like V"'"', the covariant derivative would be 

= d^V"" + Tl^V'"' + ujl^y"^ (6.8.79) 

where V^^ is the Levi-Civita connection. 

Now, the whole point of a tensor quantity is that it doesn't depend on how it is written 
- this ties into the fundamental law of physics that physics shouldn't depend on how we 
choose to describe the universe. Therefore there should be some relationship between the 
spin connection and the Levi-Civita connection. In other words, there should be a way to 
relate the following covariant derivatives 

D^v'' = d^v^ + ujI^^v'' (6.8.80) 

because they are saying the exact same things. 

To find this relationship, first consider a vector field written in out in an index free 
fashion: 

d 



dx 

{d.v'' + T'^^v'^) rfx^ ® ^ (6.8.81) 



Now do the same thing but use a vielbein index on the vector: 

Vv = {D^v") dx^ ® Ga 



{dX + w>') dx^ ® ea (6.8.82) 
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d 



Now transform the vielbein index back into a spacetime index: 

= {d^v- + v^e^d.el + e^e^u;>'^)cix'^ ® ^ (6.8.83) 

Then, because the tensorial nature of physics demands that be the same no matter 
how we describe things, we must have 

From this we get 

V = + ^^Hb^" (6.8.85) 
or dropping the v'^ common to each term, 

r^, = e'^d.el + elelul, (6.8.86) 

Or, inverting this, 

ul, = elelVl, - eld.el (6.8.87) 
So, the complete covariant derivative of a vector field v^^ written in the vielbein basis 

is 

D,v^ = d,v^ + uj;,v' = d.if + {eyiVl^ - eld,el)v' (6.8.88) 

Recall that the point of a covariant derivative is to ensure that the derivative term 
transforms covariantly, or in a nice tensorial way. This allows us to see what the trans- 
formation law for the connection must be. Under a spacetime coordinate transformation 
it should be clear that there will be no problems - the derivative isn't acting on anything 
with a spacetime index and therefore we won't get any additional terms. This tells us that 
the spacetime index on the spin connection uj'^^^ transforms tensorially. 

However, under an internal vielbein Lorentz transformation, there will be an addi- 
tional term because the partial derivative acts on the v"-, which has an internal latin index. 
So, assuming that the transformation law for the spin connection is a tensorial term plus 
a non-linear term (as usual), it will be 

<6 <6 = K^t^U + ^::;> (6-8.89) 

(where is the term we want to find), we can write the transformation for the covariant 
derivative as 

= Kl{D,v')+v\d,Kt + AlQl,) (6.8.90) 
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So, we get a covariant transformation as long as the last term vanishes: 





or 



And so the spin connection transforms according to 



fib 



UJ' 



-l\b' 



fib 



(6.8.91) 



(6.8.92) 



(6.8.93) 



under an internal Lorentz transformation. 

We now have the tools to discuss how fermions can be coupled to gravity. 

6.8.4 Gravity Coupled to Fermions 

We have done the bulk of the work necessary to couple fermions to spacetime in the 
previous sections, and we will therefore treat this topic briefly. We know that after gaug- 
ing the symmetry and introducing a covariant derivative we will need a field strength. 



We worked out what this will be in section 6.8.2 - the Hilbert Lagrangian 



\9\R 



(6.8.94) 



where R is the Ricci curvature scalar. 

Next, we look at the Lagrangian for a Dirac fermion (including the invariant volume 
form part): 



D 



(6.8.95) 



As discussed above in the Review of Gauge Theory part of section 6.8.1 the gauge 
group for general relativity will be the Lorentz group. We'll walk through the usual steps 
in gauging this symmetry with hopes that doing so makes our exposition easier to follow. 

The fermions will transform under the spinor representation of the Lorentz group (cf 
l|40| , section 3.1.5), called the spinor representation, and we denote a general Lorentz 
transformation in the spinor representation as 5 (A): so 

tfj — > S{A)ij and — >i)S-^{A) (6.8.96) 

So, under a global Lorentz transformation the Dirac Lagrangian ( 6.8.95[ > will transform as 

= (6.8.97) 
So, =2/) is indeed invariant under a global Lorentz transformation. 
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However, if we gauge the symmetry and let 5'(A) depend on spacetime (a local Lorentz 
transformation), we have (suppressing the (A) for notational simplicity) 



= V|^|V'(^7''9^ - m)^p — > ^/\g\^S^\iY^^, - m)Silj 

= ^M^S-\tYSd^ + t^id^S) - m)ij (6.8.98) 

which as expected has a non-linear term that we need to cancel. We achieve this cancella- 
tion by introducing a connection, or a gauge field, via a covariant derivative. We assume 
the form of this covariant derivative to be 

D^ = d^ + (6.8.99) 

where fi^ is the connection we have yet to determine. 

However it is at this point that we need to be careful. The close analogy we have 
been maintaining between the differential geometry developed in this paper the gauge 
theory formalism we developed in the previous paper would indicate that because we 
are coupling the fermions to spacetime, we should use the connection of spacetime, the 
Levi-Civita connection F^^, as our gauge field. However this turns out to be incorrect. 

The reason this is incorrect is that the Levi-Civita connection acts on vectors with a 



spacetime index. For example in section 4.2.2 (i.e. equation (4.2.371) the Levi-Civita con 



nection was being used to form the "gauge field", or covariant derivative, for a vector 
with indices relating to the coordinates of the manifold. 

But our fermion fields ip and ip don't have spacetime manifold indices. As we dis- 
cussed in ||40ll , the degree of freedom of a fermion, called spin, is not a rotation through 
spacetime. Rather, it is a rotation through "spinor space". This is an internal degree of free- 
dom. Therefore the Levi-Civita connection is not the tool we need (at least, not exactly). 

However, as you no doubt have realized, this idea of an internal degree of freedom can 
be handled extremely well with the vielbein formalism developed in the previous section 
( |6.8.3[ ). The essential idea of that section was to introduce a vielbein, or basis at each 



point that was unrelated to the coordinates of the spacetime manifold, and by writing 
quantities in terms of the vielbein, we were working with an internal space. This lead to 
the notion of a spin connection which allowed us to form a covariant derivative for objects 
that had no spacetime index but rather had an internal index. And that is exactly what we 
have with fermions! They have no spacetime index, but they have an internal spin index. 



Therefore, by simply replacing from section 6.8.3 with the spinor representation of the 
Lorentz group, 5^, we can form a covariant derivative for ^. 

First, we note that like any transformation, we can write a Lorentz transformation S as 
the exponentiation of the Lorentz algebra generators (where the generators are the spinor 
representation generators), which we denote T."-''. So 

S = e^""-"^"' (6.8.100) 

(the factor of 1/2 is conventional) where aab are the parameters for the Lorentz transf or- 
mationP^ 

^*It is straightforward to show that both S"'' will be an antisymmetric 4x4 matrix, which has 6 real 
independent components corresponding to three rotations and three boosts. 
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Then, working with the spin connection for our internal spinor space, we expand the 
connection in terms of the generators (from (6.8.99)): 



(6.8.101) 



(where we have lowered one of the indices on the generator matrix to keep our notation 
on the spin connection consistent with the previous section - this is no problem because 
we can of course raise and lower indices at will with the metric. 
So, our covariant derivative is 



(6.8.102) 



We leave it to you to show that replacing the partial derivative in J^d with (6.8.102 1 along 
with the transformation for the spin connection ( 6.8.93[ > exactly cancels the non-linear 
term, making with the spin connection covariant derivative invariant under the local 
Lorentz transformation. 

So, finally, we can write out the action for fermions coupled to gravity: 



S 



d'^x^/\g\[iJ{i'J^Df^ - m)ip + kR) 
d'x^\{i,{ir{d, + - m)^ + kR) 



(6.8.103) 



Or for a theory containing both scalar fields and bosons. 



S 



d x^\g 



(6.8.104) 



6.9 References and Further Reading 

The primary general sources for this chapter were IjU and |fll|. Most of the discussions 
of the meaning of the various concepts came from 0. The discussions of the equivalence 
principle and Newtonian correspondences came from (81. The discussions of solutions 
to the geodesic equations came from |[1T|. We following [12] to find the Schwarzschild 
solution. We followed [18] and [48] in discussing the stress tensor and energy momentum 
tensor, and [29] in deriving the modern formulation of the energy momentum tensor. For 
general relativity as a gauge theory we used [8] and Il37l . 

For further reading in special relativity we recommend HI, lUSlI , Il35ll , and ||50| . For 
further reading in general relativity we recommend [|6j, ||20l, ||33l, EH, ||49l, and IISTfl 
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Chapter 7 



Concluding Thoughts on Geometry, 
Topology, Gauge Theories, and Gravity 



We'll conclude with a few comments relating to the gauge theory of general relativity 
that, while not necessary to understanding Einstein's class theory or the derivation or 



meaning of (6.8.1041, will help tremendously in preparing us for where we are going with 
this series. 

You may have noticed that we did a slight slide of hand in our derivation of the spin 



connection in the previous section. Specifically, on page 375 we pointed out that the 



similarity between the "internal" spinor space and the "internal" vielbein space of section 



6.8.3 (in the paragraph beginning with "However, as you no doubt ..."). But, recall that the 
idea of a vielbein was that forms a basis for the tangent space of the manifold. They are 
nothing more than spacetime vectors defined at a point - and obviously the spinor space 



is not the same as a tangent space. So we made a fundamental change from section 6.8.3 
to section [6M 



Recall that in section 6.8.3 we commented that by introducing the vielbein we have, 
in essence, attached a copy of the Lorentz group 5*0(1, 3) to every point on the spacetime 
manifold A^. However, this additional space arose naturally, even necessarily, from the 
fact that the basis for the tangent space at a point is completely arbitrary and any basis 
will work as well as any other, and therefore the 5*0(1, 3) in this framework acts on the 
basis vectors, and is therefore the vector representation of 5*0(1, 3), not the spinor repre- 
sentation. However with the vielbein that lead to the spin connection, the internal indices 
are not simply elements in the tangent space - they are in spinor space. And, unlike the 
tangent space which arises necessarily from the manifold, the addition of a spinor space 
is somewhat artificial - there is no pre-existing reason to add a spinor space to a manifold. 

So, let's take stock of what we have. We have a manifoldH and naturally associated 
with the manifold is a tangent space and a cotangent space. Then there are tensors of 
various ranks that can live in the tangent and cotangent spac^ In order to take the 
derivatives of tensor fields on the manifold we must be able to compare tensors in dif- 
ferent tangent spaces. We do this by introducing the Levi-Civita connection through the 



^In physical language this is spacetime. 
^These are our physical fields. 
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covariant derivative. The covariant derivative then replaces the partial derivative for dif- 
ferentiating tensor fields that live in the vector and covector space of the manifold. Also, 
as a direct result of the structure of the tangent space at every point of the manifold, we 
have a symmetry associated with our ability to choose any basis of the tangent space at 
each point. All that is required is that the basis be orthonormal - this leads to the symme- 
try being 50(1, 3) (or SO{n, m) for arbitrary manifold with Lorentz signature and SO{n) 
for arbitrary manifold with Euclidian signature). We can treat this 5*0(1, 3) etc. as an in- 
ternal degree of freedom, but again this degree of freedom has arisen very naturally from 
the structure of the manifold - no new ideas are introduced in arriving here. Then, as we 
saw, we can express any spacetime tensor, rather than in terms of the manifold coordi- 
nates, in terms of this internal "vielbein" space. Again, doing this doesn't require any new 
ideas - this has all risen naturally from the structure of the manifold. 

On the other hand, as we have shown via the spin connection, we can go through all of 
this again but in a way that doesn't arise naturally from the structure of the manifold. We 
can start with some manifold with all the same things as above (tangent and cotangent 
spaces, tensors, a tangent space induced vielbein space, etc.). But, we may then introduce 
something other than merely the transformation group on the tangent space basis vectors. 
We may introduce, say, the spinor representation of 50(1, 3) as we did in the previous 
section, or we may introduce absolutely any other space we want. We may write a basis 
at each point for our "artificial" space we've attached. We can then express any element 
of this space in terms of the basis we have chosen at each point. 

So we have two different types of "things" attached to our manifold. One is the tan- 
gent/ cotangent space structure which arises naturally and is automatically built in to our 
geometry. The other is a somewhat "artificial" space we attach to the manifold. In both 
cases we can talk about fields "living" in a vector space that is defined at every point of 
the manifold. Introducing some new notation, we will refer to the former the Tangent 
Bundle (which consists of the tangent space, the cotangent space, and all the higher di- 
mensional tensor spaces resulting from this in the natural way), and the latter "artificial" 
spaces will be referred to as Fibre Bundles. 



Looking back to page 44 in section 2.3.3 we briefly mentioned this idea. Specifically, if 



our manifold is M, the tangent bundle TM is the product M ®MP (cf equation ( |2.3.2 )). For 



these more general fibre bundles we are introducing now, if the space we are artificially 
attaching is denoted Q, the total space is (sort oQ M. ® Q. And, just as an individual 
element of the tangent bundle is a single tangent space at a point, a single element of the 
fibre bundle is a single fibre at a point. So, the copy of ^ at p G is the fibre at p. The 
underlying manifold M. is called the Base Space of the total structure, which we call the 
Total Space. 

Another important point is that the total spaces, consisting of the base space, the tan- 
gent bundle, and the fibre bundles, can genuinely be viewed as single larger spaces. As 
we will see, the details on how everything works together can be very complicated, but 
it is still the case that these total spaces are well defined geometries that we can view as a 



■'This "sort of" is enormously important and we will discuss it in much, much, much greater depth in the 
next paper in this series. In reality the total space is only locally M.®Q- but more on that (including what 
that means) later. 
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single geometrical object. 

We also mentioned on page 44 the idea of a projection map. The idea was that, at 
a given point p E M there is a copy of the tangent space M", and therefore any tangent 
vector at p, denoted {p, v), can be "projected" to the point of M the vector is attached to 
- or in other words v is projected to the point of M such that v G TpM. We call this 
projection map vr: 

TT : TpM — > M 

(p,v) ^ p (7.0.1) 

Then, we can use the inverse of vr at every point to recover the entire tangent space: 



TT 



-1 



(p) = TpM (7.0.2) 



We will make extensive use of the same idea for general fibre bundles. At every point 
p e M we can attach the vector space Q. Then, some element g E G cane be projected 
down to the point to which it is attached with tt, and from p E M. we can recover the 
entire fibre: 

n-\p) = g (7.0.3) 

Again, we want to emphasize that the difference between what we are calling the tan- 
gent bundle and fibre bundle is that the tangent bundle is intimately associated with the 
base space manifold, whereas the fibre bundle is not - the fibre bundle is defined entirely 
independent of the base space. So, a total space S with base manifold M, tangent bundle 
TM, and fibre bundle Q may be writterij^ 

S = M®TM®g (7.0.4) 

Or, if we want, we can attach multiple fibre bundles: 

S = M0 TM ® ^1 ® ^2 ® ■ ■ • (7.0.5) 

You can likely imagine that the total space 

S = M^^ TM^ ® SU{3) ® SU{2) ® f/(l) (7.0.6) 

which is of course viewed (in this paradigm) as a single geometrical space, is particularly 
interesting. 



As we noted in section 6.8.3 an arbitrary vector with spacetime indices can be writ- 
ten in terms of the vielbein indices, f ° = w '^e^. This is a direct consequence of the fact 
(that we have repeated several times in this section) that the vielbein space is a direct 
consequence of the manifold itself and is intimately tied to it. For an arbitrary fibre bun- 
dle, however (as we have also repeated several times in this section), there is absolutely 
no necessary correlation whatsoever between the geometry of the base manifold and the 

^Again, the G part of this may only locally have a Q attached to it in this way - for now don't worry about 
it. We just want to mention this for readers who already have some familiarity with these concepts. 
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geometry of the fibre. Therefore it isn't in general possible to write a spacetime vector in 
terms of the basis for the fibre. This brings us to the notion of two fundamentally different 
types of fields (cf the comparison between gravity and electromagnetism starting on page 
^in section |6AT| ) 



The first type of field are the fields we have been discussing all along in these notes 
- tensor fields that have spacetime indices. These are fields that live in the TA^ and the 
higher dimensional tensor spaces that we form naturally from TAi. Vectors, forms, ma- 
trices, and so on - anything with a spacetime index is such a field. Again, these are fields 
that "live" in the tangent and cotangent spaces, which (again) are spaces that are deeply 
and intimately linked to the manifold itself. 

The second type of field are fields that "live" in fibre bundles. These are fields that 
don't necessarily carry any spacetime indices and therefore can't be expressed as tensors 
with spacetime indices. These are fields that don't arise as a result of any natural structure 
on the base manifold - they must be put "on top of" the manifold. 

We already discussed this distinction a bit in sections 6.4.1 and page 364 of section 



6.8.2 The content of the equivalence principle can be stated more mathematically by 



simply saying that gravity is entirely a result of things relating to the first type of field. 
On the other hand, as we mentioned in section 6.4.1 there is no equivalence principle for 
the other forces, like electromagnetism. This can be stated more mathematically by simply 
saying that the other forces (like electromagnetism) are the result of things relating to the 
second type of fields. 

As usual, we are skimming the peaks of mountains here. There is much, much more 
we can say about all of this, and indeed much more that we will say. The notion of 
fibre bundles over manifolds and the types of structures that can be built out of and on 
the resulting total spaces provides one of the richest and most profound areas in all of 
mathematics and physics, as well as providing a way of revealing deep relationships 
between gauge theories and general relativity and between geometry and topology. 

And so it is here that we conclude these notes. We have outlined the geometrical and 
topological ideas necessary to make sense (at least at an introductory level) of general rel- 
ativity and several aspects of electromagnetic theory. And, by applying the mathematical 
ideas to the physical situations we have, we have gained a shadowy glimpse into much 
fuller and richer structure of fundamental physical theories. The next paper in this series 
will begin to sharpen this glimpse, and by its end we should have a much more solid 
grasp of the profound relationship between physics and mathematics. 
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